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Abstract: De Finetti's classical result of [18] identifying the law of an 
exchangeable family of random variables as a mixture of i.i.d. laws was ex- 
tended to structure theorems for more complex notions of exchangeability 
by Aldous [1, 2, .3], Hoover [11, 12], Kallenberg [11] and Kingman [17]. On 
the other hand, such exchangeable laws were first related to questions from 
combinatorics in an independent analysis by Fremlin and Talagrand [20], 
and again more recently in Tao [(>2], where they appear as a natural proxy 
for the 'leading order statistics' of colourings of large graphs or hyper- 
graphs. Moreover, this relation appears implicitly in the study of various 
more bespoke formalisms for handling 'limit objects' of sequences of dense 
graphs or hypergraphs in a number of recent works, including Lovasz and 
Szegedy ['I'l], Borgs, Chayes, Lovasz, Sos, Szegedy and Vesztergombi [17], 
Elek and Szegedy [21] and Razborov [54, 55]. However, the connection be- 
tween these works and the earlier probabilistic structural results seems to 
have gone largely unappreciated. 

In this survey we recall the basic results of the theory of exchangeable 
laws, and then explain the probabilistic versions of various interesting ques- 
tions from graph and hypergraph theory that their connection motivates 
(particularly extremal questions on the testability of properties for graphs 
and hypergraphs). 

We also locate the notions of exchangeability of interest to us in the 
context of other classes of probability measures subject to various sym- 
metries, in particular contrasting the methods employed to analyze ex- 
changeable laws with related structural results in ergodic theory, partic- 
ular the Furstenberg-Zimmer structure theorem for probability-preserving 
Z-systems, which underpins Furstenberg's ergodic-theoretic proof of Sze- 
meredi's Theorem. 

The forthcoming paper [] ()] will make a much more elaborate appeal to 
the link between exchangeable laws and dense (directed) hypergraphs to 
establish various results in property testing. 
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1. Introduction 

This survey paper is about the laws of random colourings of the complete k- 
uniform hypergraph (^) on a countably infinite vertex set S that are invariant 
under permutations of the vertex set. When fc = 1 a complete description of 
these laws follows from a classical result of de Finetti [18, 19] for {0, l}-valued 
exchangeable random variables. More generally, suppose that (if, T^k) is a stan- 
dard Borel space (which will serve as our space of 'colours') and that k>l. We 
shall be concerned with the structure of those probability measures /i on the 

measurable space (A''-'='',Ej^ ) (the set of all AT-coloured complete fc-uniform 
hypergraphs on S) that are invariant under the natural vertex-permuting action 
Symg(S') r\ (^), where Symg(S') is the group of finitely-supported permutations 
of S*. 

Measures (or, equivalently, the associated canonical processes of coordinate- 
projections onto K) enjoying such symmetries were subject to a number of 
studies during the 1970's and 80's, culminating in the first complete analyses by 
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Kingman [47], Hoover [41, 42], Aldous [1, 2, 3] and Kallenberg [44] for increas- 
ingly general classes of process. 

More recently, a similar structural description has emerged independently 
in the work of a group of researchers on 'limit objects' for sequences of large 
finite graphs or hypergraphs, whose structure can often serve as a 'proxy' for the 
'leading order statistics' of such graphs or hypergraphs (see, for example, Lovasz 
and Szegcdy [-VJ], Borgs, Chayes, Lovasz, Sos, Szcgcdy and Vcsztcrgombi [17], 
Elek and Szegedy [24]). 

We shall survey the former area (giving a description close in spirit to those 
of Aldous [3] and Kallenberg [44], where the picture is more complete), and then 
describe how these two strands of research are actually closely related. 

The link between them areas arises because exchangeable random colourings 
can themselves serve as such limit objects, and because once this identification 
is made many of the results of the more recent formalisms simply follow from the 
older structure theorems for exchangeable laws. This basic identification seems 
to appear first in Tao [62], where parts of the older structure theory are then 
implicitly re-proved, but without a development of the full formalism. 

In addition, similar structural results were already obtained by Fremlin and 
Talagrand in [29] (a paper that seems to have gone unnoticed by many more 
recent researchers) for a class of random graphs that are subject only to a rather 
weaker symmetry than full exchangeability: in the terminology we will adopt 
below their random graphs on N are 'spreadable', according to which all induced 
finite random subgraphs on a fixed number of vertices have the same law so long 
as the order of those vertices is respected. This requirement of order- preservation 
demands a more subtle analysis than in the exchangeable case. Spreadability 
was studied by Ryll-Nardzewski [-58] and Kallenberg [44] as a natural weakening 
of the hypothesis of exchangeability, but the work of Fremlin and Talagrand [29] 
also includes an analysis of a related extremal problem (about critical densities 
for finite subgraphs of certain infinite random graphs), in an early precursor 
to more recent work relating such questions in finitary combinatorics to the 
analysis of these random graphs. 

Tao's use of exchangeable random hypergraph colourings as such proxies is 
motivated by a Furstenberg-like correspondence principle between properties of 
finite hypergraphs and those of exchangeable random hypergraphs. He goes on 
to give an infinitary analysis of versions of the graph and hypergraph removal 
lemmas. This makes concrete certain analogies between Furstenberg's ergodic- 
theoretic work and hypergraph-based approaches to proving Szemeredi's Theo- 
rem, and in many respects the present paper is a continuation of this program. 
In [1(J] we will extend the infinitary methods of [62] to give an infinitary account 
of general hypergraph property testing, calling on the structural result described 
in the present paper. 

Another abstract approach to the asymptotic statistics of large dense graphs 
and hypergraphs has recently appeared in work of Razborov [-54], [55]. His con- 
struction rests on the notion of a 'flag algebra', constructed from collections of 
combinatorial structures in a more abstract algebraic manner. Here, too, there is 
a close parallel between the analysis of the infinitary structures that result and 
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the earlier works pertaining to exchangeable random hypergraph colourings. 

In this survey we will first recall versions of the basic results of the the- 
ory of exchangeable laws, and will then examine how various purely combi- 
natorial questions admit a parallel version in the setting of these laws, and 
can occasionally shed light on the original finitary versions through a 'limit 
object' analysis. In the process of describing this link, we will show how the 
various other infinitary formalisms described above all recover essentially the 
same structure as the study of exchangeable processes. We will finish by locat- 
ing these basic underlying structural results in the broader context of ergodic 
theory, where a related but necessarily less complete analysis underlies the fun- 
damental Furstenberg-Zimmer structure theory for probability-preserving sys- 
tems, and so — through a correspondence principle with finitary combinatorics 
similar to that mentioned above — enables Furstcnberg's proof of Szemeredi's 
Theorem. 

A much more thorough account of the theory of exchangeability and related 
symmetries for stochastic processes, as well as its historical development, can be 
found in the book [46] of Kallenberg. The treatment of this theory in the present 
survey will also be skewed to better exhibit its relationship with the more recent 
work in combinatorics, since the versions of the probabilistic results most central 
to this relationship (our Corollary 3.5 and Theorems 2.9 and 3.21) are not quite 
the most general known to probabilists (which are more closely related to the 
setting of partite hypergraphs that we examine in Subsection 3.7). 

Remark. As this paper ncarcd completion, many of the main relations between 
exchangeability and hypergraph theory that it was written to advertise were also 
independently reported in work of Diaconis and Janson [20]. 

2. Preliminaries 

2.1. Background notation and definitions 

Combinatorics 

In this paper we will often be concerned with uniform hypergraphs over some 
countably infinite vertex set S. We shall write (^) for the set of fc-subsets of S 
and (|J (resp. (4)) for U,<fc (f) (resp. U^^, (f)), including (/^) for the set 
of all finite subsets of S. 

We shall denote the subset of tuples (si, S2, • ■ • , Si) G 5* with all Sj distinct 
by Inj([i], S) (a notation that will re-appear in [lo], where injections are conve- 
niently treated as the morphisms of a category of index sets). 

In addition to ordinary fc-uniform hypergraphs, wc will need to consider mul- 
ticoloured hypergraphs. 

Definition 2.1 (Finite palettes). By a finite /c-palette we understand a se- 
quence K = {Kij'l^Q of finite sets. We shall refer to k as the rank of the palette. 
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Remark. Wc shall later have cause to extend the above definition to general 
A:-palettes, whose individual spaces are endowed with a standard Borel structure 
but need not be finite. < 

Definition 2.2 (Coloured hypergraphs). Given a vertex set S and a finite 
set A'fc, we define a X-coloured fc-uniform hypergrapii on 5* to be a map 
H : (^) — > A'fc, and a ii'-coloured fc-uniform hypergraph on 5* witii loops 

to be a map H : Kk that is invariant under coordinate permutations of 

yk^ 

More generally, given a finite fc-palette K = {Ki)^^Q we define a /C-coloured 
hypergrapii on S' to be a sequence Hi of i^i-coloured z-uniform hypergraphs 
on 5 for i = 0, 2, ... , fc. 

We will also need the analog of the above definition in the directed case. 

Definition 2.3 (Coloured directed hypergraphs). Given a vertex set S and a 
finite set Kk , we define a JsT-coIoured directed fc-uniform iiypergrapii on S 
to be a map H : Inj([fc], S) — > Kk, and a iC-coloured fc-uniform iiypergrapii 
on S witli ioops to be a map H : Kk, and more generally given a finite 

fc-palette K = {Ki)^^Q we define a /ST-coloured directed iiypergrapii on S 
to be a sequence Hi of /iTi-coloured directed i-uniform hypergraphs on S. 

Given a if-coloured (directed) hypergraph H on V and a subset of vertices 
W C V we write H\w for the induced (directed) hypergraph on W given by the 
sequence of maps Hili^i. 

If iS" is endowed with a total order <, we shall sometimes write {si < S2 < 
■ ■ ■ < Sji} for a subset {si, S2, ■ ■ ■ , Sn} whose members are understood to have 
been listed in increasing order. 

We shall write SymQ(S') or just Symp for the group of finitely-supported 
permutations of S, and given a subset ICS and some g G Symo(S') we shall 
write g(I) for {g{x) : x £ I}. 

Measure theory and probability 

We assume familiarity with basic measure theory and probability; a suitable 
reference for all the background we will need is Kallenberg [4")]. We recall here 
only some slightly more specialized notions. 

We shall generally work with measurable spaces that are standard Borei 
spaces: that is, that are isomorphic as measurable spaces to a Polish space with 
its Borel cr-algebra. Occasionally we shall use a proper cr-subalgebra of the Borel 
cr-algebra of such a space. The basic properties of these spaces are described in 
Appendix A of [45] (under the shorter name 'Borel spaces'). When we refer to a 
space K as standard Borel, its cr-algebra is to be understood, and will be denoted 
by Sif when necessary; similarly, we shall usually refer to a probability measure 
'on K^ in place of 'on S/f '. The standard Borel assumption has various important 
technical consequences for the management of cr-subalgebras and measurable 
functions: for example, given a probability measure fi on such a space X, we may 
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adopt the common convention from ergodic theory of identifying a cr-subalgebra 
T of Ex with a factor map (f) : X ^ Y into some other standard Borel space 
Y , defined uniquely up to equaUty /^-almost everywhere. When endowed with 
a Borel probability measure, we will refer to the resulting measure space as a 
Lebesgue space; note that point masses are allowed in this convention, so 
these spaces may have atoms. 

The reader who is uncomfortable with these definitions will lose nothing by 
assuming throughout that our underlying spaces are compact metrizable and 
carrying their Borel measurable structure. 

Suppose that Y and X are standard Borel spaces. Then by a probability 
kernel from F to X we understand a function P -.Y y. Ex [0, 1] such that 

• the map y ^ P{y, A) is Ey-measurable for every A e Ex; 

• the map A i-^ P{y, A) is a probability measure on Ex for every y € Y. 

One instructive intuition about such a kernel is that it serves as a 'randomized 
map' from Y to X: rather than specify a unique image in X for each point 
y G Y, it specifies only a probability distribution P(y, • ) from which we should 
choose a point of X. The first of the above conditions is then the natural sense 
in which this assignment of a probability distribution is measurable in y; and, 
indeed, a popular alternative definition of probability kernel is as a measurable 
function from Y to the set Pi X of probability measures on X. We will adopt 
the standard notation P(y, dx) for the measure associated by such a kernel to 
the point y, and will write P : Y X when P is as above. 

Given a kernel P : Y X and a probability measure on y, we define the 
measure P#i/ on X by 



this measure on X can be interpreted as the law of a member of X selected 
randomly by first selecting a member of Y with law v and then selecting a 
member of X with law P(y, ■ ). By analogy with the case of a function between 
measurable spaces, we will refer to this as the pushforward of i/ by P. This 
extends standard deterministic notation: given a measurable function (j) : Y 
X, we may associate to it the deterministic probability kernel given by P{y, ■ ) = 
S^(^y) (the point mass at the image of y under 0), and now P^v is the usual push- 
forward measure (f>^v. 

Certain special probability kernels naturally serve as adjoints to factor maps, 
in the sense of the following theorem. 

Theorem 2.4. Suppose that Y and X are as above, that fi is a probability mea- 
sure on X and that (j) : X ^ Y is a measurable factor map. Then, denoting 
the push-forward (j^^fJ- by v, there is a v-almost surely unique probability ker- 
nel P : Y X such that /i = P^v and which represents the conditional 
expectation with respect to (p: for any f G L^{p^), the function 




xi^ I f{x) P{(j}{xi),dx) 
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is a version of the ^-conditional expectation of f with respect to ^(Ey). We 
call this P the disintegration of ji over <f). 



Motivated by this result, we will sometimes refer to data {Y, P) for which 
II = P^v as a quasifactor of {X, /i), even when it docs not arise as the adjoint 
of some factor map. An alternative convention is to interpret a quasifactor as 
a probability distribution on the set PrX of probability measures on X, where 
Vt X is given its 'evaluation cr-algebra'. This corresponds to our definition by 
identifying v ^VrY and P .Y X with the law of P under v regarded as a 
measurable function Y PrX. A more detailed discussion of quasifactors in 
ergodic theory can be found in Chapter 8 of Glasner [.'>.^>] , where this alternative 
convention is used. 

It is clear that in general a probability kernel need not correspond to a dis- 
integration over some factor map: indeed, if P is the disintegration of p, over a 
map then it must obey the extra condition that its fibre measures P(y, ■ ) are 
mutually singular and concentrated on the fibres (j)~^{y\. Thus, for example, 
letting p : [0, 1]^ ^ (0, oo) be some strictly positive Borcl measurable function 
for which 



gives a probability kernel with all fibre measures mutually absolutely continuous, 
and so this P is not a disintegration of any measure over some measurable map 
[0, 1] [0, 1]. If the probability kernel P does arise as a disintegration over some 
map : X ^ F, we will write that recovers P; if no such exists we call 
P unrecoverable. It turns out that under reasonably general hypotheses the 
mutual singularity of the measures P{y, ■ ) is sufficient, as well as necessary, for 
recoverability, but we will not make use of this fact; see, for example, Theorem 
8.3 in Glasner [35]. We will be interested in certain instances of unrccoverability 
later in this paper. It is worth noting, however, that any quasifactor P :Y ■'^ X 
with p = P^u defines a joining A on (F x X, Ey (g) Ex) of {Y, u) and (X, ^) by 
setting 



and now we can identify both (X, p) and (y, v) as true factors of the larger 
probabihty space (FxX, Ey(g)Ex, A) so that P gives the conditional distribution 
of the second coordinate relative to the first: that is, if Q • ^ "-"^ ^ ^ X is 
the disintegration of A over the first coordinate and ttx : y x X ^ X is the 
projections onto the second coordinate then P = ttx o Q. 

Various natural operations on either measures or functions have analogs for 
probability kernels, and we will need some of these. U P : Y X and S is 



Proof. See Theorem 6.3 in Kallenberg [45]. 



□ 




for all X e [0, 1] and defining P : [0, 1] [0, 1] by 
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some index set then we will write P®^ for the probability kernel Y 
defined by P^^{y, ■ ) := P{y, ■ )®^, the simple 5-indexed product of copies of 
the probability measure P{y, ■ ) (it is routine to check that this retains the 
measurability properties required of a probability kernel). On the other hand, 
if Z is a third standard Borel space and (j) : Z ^ Y and Q : Z ^ Y, we define 
the compositions P o cf) : Z X and P o Q : Z X hy 

Po0(z, .) and P o Q{z, ■) -.^ j^P{y, ■) Q{zAy). 

In the remainder of this paper we shall be interested in various classes of 
stochastic process: a family of random variables tt^ all defined on a single 
'background' probability space, indexed by some set T and taking values in 
standard Borel target spaces if*^*). In many of our examples these target spaces 
K^*^ will all agree; more generally there will be a short list of different target 
spaces i^o, K2, ■ ■ ■ , Kk, with tti taking values in Ki according as t falls in the 
cell of some partition T = Tq U Ti U • • • U T^. 

It is a common and convenient practice in probability to regard the back- 
ground probability space that supports these random variables as 'hidden', and 
to perform all desired constructions in terms of the particular functions tt^ ; how- 
ever, in this survey we shall instead work largely with canonical processes: 
that is, those for which the underlying probability space is itself the product 
X = HtGT = ^0" ^ ^ " ■ ^k'° ^i*'^ product measurable structure 
and some probability measure, and T:t : X ^ K'^*''^ is just the projection onto 
the coordinate indexed by t for each t £ T. Given this projection interpreta- 
tion, we will sometimes extend our notation by writing tt/ for the projection 
of ritGT onto Yitei ^^^^ fo'^ any subset I C_ T. The underlying probability 
measure on X is referred to as the joint law of the process. 

Not only is this picture quite concrete and intuitive for the arguments that we 
will need, but it will also emerge naturally when we turn to the 'correspondence 
principles' relating our work to structures in combinatorics. 

2.2. Exchangeable families of random variables 

Our main objects of study will be families {TTt )teT of random variables with val- 
ues in some spaces of 'colours' Kq, Ki, ... or Kk, and subject to the additional 
hypothesis that their joint law is invariant under some class of permutations of 
the (countably infinite) index set T. Clearly these permutations must preserve 
the partition Tq U Ti U • • • U in order for this to make sense. We will discuss 
later how such processes can serve as 'proxies' for the statistics of some large 
combinatorial structure (such as a dense graph or hypergraph), and for these 
examples such a symmetry assumption will arise owing to an equivalent symme- 
try in the statistics we wish to count: for example, the number of induced four- 
simplices in a three-uniform hypergraph is unaffected by vertex-permutations 
of that hypergraph. 
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Wc will consider difTercnt theories corresponding to different classes of per- 
mutations. If r is a group of permutations of T, preserving the partition Tq U 
Ti U • • • U Tfc where relevant, then following Aldous [3] we refer to the joint 
law fjL of the family {■Kt)t^T as (T, r)-exchangeable if it is invariant under the 
coordinate-permuting action of F: that is, writing : Kq° x K^^ x • • • K^'' 
X K'^^ X • • • k'^'' for the coordinate-permuting action 

Ta{{ut)t^T) - K(t))teT 5 e r, {uJt)t^T e K^^' X Af 1 x • • • J^ 

we ask that fi = {t^)^ii for all g e F. We shall extend this notation for the action 
T to apply to maps between different index sets: if i/' : T — *■ T' is a bijection 
that respects corresponding partitions Tq U Ti U • • • U Tfc and U T{ U • • • U T^. 
then define t'^ : x A'f ^ x • • • A'J^ x K^^ x ■ ■ ■ A' J" (notice the 

contravariance) by 

T'^((^t')t'eT') ('^V'(t))tGT- 

We choose to make both T and F explicit in our nomenclature since many of 
our examples will be obtained from actions of the same group, but on different 
sets. In particular, the leading examples of the theory of exchangeability are the 
following: 

1. Let 5 be a countably infinite set, whose members we shall refer to as 
'vertices', and let T = 5* and F = SymQ(S'), the group of all finitely- 
supported permutations of S.^ Now we may interpret A' as a set (possibly a 
continuum) of 'colours', and a point of as a AT-colouring of the vertices 
in S. A joint law on may be interpreted as a random such colouring 
(with the individual random variables tt^ giving the colours of individual 
vertices), and the exchangeability of a random AT-colouring asserts that 
its law does not depend on the ordering of the vertices. This example 
(albeit not with this terminology) is the oldest and simplest instance of 
exchangeability to have been studied, leading to the complete structural 
description given by work of de Finetti [18, 1!)], Dynkin [21] and Hewitt 
and Savage [40], and was the genesis of most subsequent work; we shall 
review it in Subsection 3.1 below. 

2. Next let S and F be as above, but take for T the set (^) of all k-hyperedges 
of the complete graph on S. Now our joint law corresponds to a random 
A'-colouring of the complete fc-uniform hypergraph on S that is invariant 
under all finitely-supported vertex-set permutations. In Subsection 3.2 we 
shall recount the complete description of these in case k = 2 that follows 
from the work of Kingman [47] , Hoover [42] and Aldous [2] , and shall then 
extend those results to general hypergraph colourings in Subsection 3.3 
following Kallenberg [44] . (Although we note that their work actually cor- 
responds to the more general setting of partite graphs and hypergraphs, 
to which we turn in item 5 below.) 

^We make the assumption of finite support only because working with all permutations 
introduces the additional technicalities of working with an uncountable group; however, with 
the right conventions these are routinely surmountable, and the resulting theory is easily seen 
to be equivalent. 
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3. However, in the course of analyzing the above examples we shall meet a 
natural need to generalize them further, in particular in order to formu- 
late an inductive argument that can be closed. Thus we shall be mostly 
concerned with random A'-eolourings on S for some general fc-palette 
K = {Ki)i^Q, with the above example corresponding to the case in which 
all Ki for i < k — 1 are singletons. It is this setting that motivates the 
above introduction of a possibly nontrivial index set partition, since we 
shall want to write T = (J^^^ T, with T, := (■^). 

4. A different extension of example 2 can be obtained by allowed coloured 
directed hypergraphs: these correspond simply to processes indexed by the 
injections in Inj([fc], 5") rather than the subsets in (^) . As in the undirected 
case, we shall also then need to allow such injections for several different 
values of fc, and so obtain a further theory that amounts to a coupling of 
several Inj([j], S')-indexed processes for different j < k. 

5. Viewing our above examples as random colourings of some combinatorial 
structure that enjoy a certain related symmetry, it is easy to cook up more 
in the same way. The range of new ideas that are needed for their analysis 
seems to be limited, but two in particular are worthy of discussion. 
Firstly, we will consider random colourings of partite hypergraphs. In fact, 
it is this combinatorial setup that corresponds most closely to the origi- 
nal study of 'exchangeable random arrays' in the probabilistic literature: 
consider, in particular, the presentation in Section 14 of Aldous [3]. In the 
simplest case, that of bipartite graphs, we require two countable infinite 
vertex classes 81,82, and then have T = Si x S2 and F the group of per- 
mutations of T obtained by a separate permutation of each Si. A more 
complicated partite example is of interest as it is the natural infinitary 
proxy for the simplex-counting that underlies modern hypergraph proofs 
of Szemeredi's Theorem (see, in particular, Nagle, Rodl and Schacht [53] 
and Gowers [36]). 

Secondly, within a space of 'colourings' such as we may obtain cer- 
tain F-invariant subspaces by demanding various relationships among the 
colours of different points of T, provided these demands are themselves 
permutation-invariant. As a refinement of our earlier directed hypergraph 
examples, these can be naturally formulated in great generality as ran- 
dom models of certain kinds of theory. In fact, these constituted some 
of the first instances of exchangeability for higher ranks to appear in the 
literature, for example in the model theoretic work of Gaifman [34] and 
Krauss [51], and in [42] Hoover cites these works as motivation for their 
further study. More recently, they have appeared appeared implicitly in 
the work of Razborov [54, 55], and in Subsection 4.3 we shall briefly relate 
his formalism to the more classical probabilistic analysis. 

Note that from the above list we have omitted the most heavily-studied in- 
stance of exchangeability of all: stationary processes, for which T = F = Z 
(or, more generally, some other group) acting on itself by (right-) translation. 
As is standard, these stationary processes include (isomorphic copies of) all 
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probability-preserving F-systcms on some abstract standard Borcl probability 
space: the basic objects of ergodic theory. 

However, that setting is, of course, quite wild: such general systems can ex- 
hibit hugely varying kinds of behaviour, most of which seems beyond a sensible 
classification. What is more surprising is that the more special examples in the 
above list all do admit some quite precise structure theorem for the exchange- 
able laws, proved using rather different, more probabilistic arguments than those 
common in general ergodic theory (which lean more towards representation the- 
ory and harmonic analysis) . The closest parallel results in general ergodic theory 
are perhaps those of Furstcnbcrg-Zimmcr structure theory ([30, 3f , 69, 68]), and 
we shall make a rough comparison between our exchangeable families and that 
theory in Section 4.7. 

Let us mention also another symmetry condition for processes closely related 
to exchangeability. Given a law on the product space , we refer to it as 
spreadable if it is invariant under any passage to subsequences: that is, if the 
law iJL of (7re)gg^ri'j is the same as that of (7r0(e))gg^N^ for any strictly increasing 

function : N ^ N. Spreadable processes formed an end-point of the origi- 
nal vein of probabilistic research into exchangeability and related symmetries. 
In [44], Kallenberg showed that for any spreadable set-indexed process (7rc)^gj-N^ 

there is some exchangeable process (7i'0)0ginj([fc].N) indexed by distinct fc-tuples 
in N such that (tTq,) has the same distribution as the subprocess of (tt^) indexed 
by tuples in increasing order, where we identify {ii < 12 < • • • < ik] G (^) with 
(zi,i2, . . . ,ifc) e Inj([A;],N); he then gave a more-or-less complete structure the- 
orem for both classes. He also extended these conclusions to processes indexed 
by the set of all finite subsets of N; this amounts to a spreadable coupling 

of the examples indexed by sets of different fixed sizes. 

Spreadable doubleton-indexed processes also appear, under the name of 'de- 
letion-invariant random graphs', in the independent work [2M] of Frcmlin and 
Talagrand; in particular, theirs may be the first use of this kind of analysis 
to answer a more purely 'combinatorial' question, albeit still about an class of 
infinitary structures. Let us also note in passing that several other names for 
spreadability have also appeared in the literature, such as 'sprcading-invariant' 
(as in Kingman [4<s]) and 'contractible' in Kallenberg's recent book [4(i]. 

We shall discuss spreadability briefly in Subsection 3.9, but will not examine 
in detail the modifications it requires to the exchangeability theory, since these 
lie further from the comparison we wish to make with combinatorics. Kallen- 
berg's paper [44] contains an essentially complete account of this subject. 

In the rest of this introductory subsection we will give some fiavour of the 
structural results that follow by considering cases 1 and 2 above. 

In case 1 , a complete description of exchangeable /i is a classical consequence 
of a theorem of de Finetti: 

Theorem 2.5 (Structure of exchangeable random colouring). Let S he an 

infinite set, K a standard Borel space and fi an exchangeable probability on 
. Then ji can be represented as a mixture of product measures: there exist a 
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Lebesgue space (Zo,^o) and a probability kernel Pi : Zq ~~> K such that 



This result serves as an important and instructive precursor to more general 
analyses, so we will recall a full proof in Section 3.1 below. 

We can rephrase Theorem 2.5 as a 'recipe' for producing all possible ex- 
changeable measures on . We first select some Lebesgue space {Zo,Ho) and 
some probability kernel Pi : Zo K, and now a /i-distributed random member 
of is obtained thus: 

• First, choose some z at random from Zq, according to /ig; 

• Then, from the law Pi{z, ■ ) that corresponds (measurably) to this choice 
of z, choose each coordinate ojg of oj independently with distribution 



How might such a recipe for building an exchangeable measure in case k = 1 
extend to /c = 2? Certainly, one possibility is to choose each edge-colour in K 
independently at random from some fixed probability distribution on K , so 

= i'q^^^ : for example, if K = {0, 1} and !^o{l} = P then this gives simply the 
infinite Erdos-Renyi random graph G(5,p). Next, just as for the case fc = 1, we 

could form a mixture of such measures, so that /i = Vo for some Lebesgue 
space (Zq, /io) and some P : Zq K . 

However, we are now able also to introduce a third possibility. For simplicity 
let us first consider a special case. If G = {V,E) is some fixed finite graph, 
then we may form an exchangeable measure /ion{0,l}(2) by first picking some 
vertex Vs &V for each s £ S independently and uniformly at random, and then 
including each edge {s,t} G (2) according to whether {vs,vt} G E. Thus, we 
take infinitely many samples of the vertices in V , indexed by the set S , and then 
join members of S by edges according to the behaviour in G of the corresponding 
sample vertices, and so form an infinite random expansion of the graph G. This 
certainly has an exchangeable law. This construction gives a fairly simple way 
to convert any fixed finite graph into an infinite exchangeable random graph by 
treating the original graph as a source of sample vertices. 

We may extend to the case of arbitrary K by now ignoring the edge set E of 
G, and introducing instead a if-coloured graph G : K with loops. This 

construction also extends directly to higher-rank hypergraphs; let us make it 
formal with the following definition. 

Definition 2.6 (Sampling random hypergraph). Suppose that H \s a, fixed k- 
uniform iiT-coloured hypergraph with loops on a vertex set V . Then we define the 
//-sampling random hypergraph on to be the law of the random 

/c-uniform A'-coloured hypergraph on S obtained by sampling for each s € S 
independently and uniformly at random a vertex Vs € V and then letting the 
colour of an edge e = {si, S2, . . . , s^} e (^) be H{vsj^,Vs^, . . . , w^J. 
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Note that it was in order to allow for collisions in this list Vs-^ , , ■ ■ ■ , 
of vertices that we required the input coloured hypergraph H to have loops. 

It is this construction of an exchangeable random hypergraph from a fixed 
finite hypergraph that underlies the correspondence principle between statistical 
properties of fixed finite hypergraph colourings and random infinite hypergraph 
colourings. We will take up this principle and some of the parallels that it 
supports between the two settings in Subsections 4.5 and 4.6. 

However, the sampling random hypergraphs do not capture all the possible 
behaviours of an exchangeable random hypergraph: we need to allow for more 
different sources of randomness. For example, if A: = 3, then in addition to sam- 
pling vertices of a fixed 3-uniform ii'-coloured hypergraph wc could consider 
an alternative sampling procedure based on a finite set W of possible edge- 
colours and a symmetric function H : K, by defining the measure 9h to 

be the law of the random hypergraph obtained by first sampling independently 
and uniformly from W a colour We for each edge e S (f ) , and then setting the 

i^-Colour of U € (g) to be H{Wei,We2,We^) when (2) = {wi,W2,'W3}. 

For higher ranks k, the obvious extension of the above idea yields a random- 
sampling construction of exchangeable random coloured hypergraphs corre- 
sponding to each intermediate rank i < k. It turns out that these different sam- 
pling procedures really do, in general, lead to distinct random hypergraphs — 
not every 'edge-sampling' random 3-uniform hypergraph as constructed above 
can be recovered by an alternative 'vertex-sampling' construction — but we 
postpone giving an example that witnesses this until Subsection 3.6, after the 
proof of the main Structure Theorem 2.9, as various elements of that proof will 
ease the analysis of the example. 

Any general structure theorem for exchangeable random hypergraphs must 
allow for all of these different sampling procedures in a single recipe; our main 
result is that, once formulated appropriately, this is essentially all we need. 
The final structure that emerges extends the above sampling construction in 
essentially two ways: firstly by introducing a whole tower of sampling procedures, 
first of vertices, then edges, then 3-hyperedges, and so on up to the full rank k; 
and secondly by allowing each of these sampling procedures its own infusion of 
'additional randomness', by replacing deterministic colourings with probability 
kernels into the relevant space of colours. It is most naturally phrased as an 
extension of the 'random recipe' we gave above as a formulation of dc Finetti's 
Theorem. 

Definition 2.7. By a sequence of ingredients we understand a sequence Zq, 
Zi, . . . , Zk~i of standard Borel spaces, a probability measure ^0 on Zq and 
(setting Zk := K), for each i = 1,2, . . . , k, a probability kernel 

p^: Zqx z'ix zP X • • • X zl^r^^'* 

that is symmetric under the natural coordinate-permuting action of Sym([i]) on 
the domain. 
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Definition 2.8. Given a sequence of ingredients 

by the standard recipe we understand the following procedure for picking a 



r 



andom member of 



• Choose Z0 G Zq at random according to the law /xq; 

• Colour each vertex s € 5' by some Zg E Z\ chosen independently according 



to Pi(2:0, • ); 

Colour each edge a = {s, t\ e (2) by some Za G Z2 chosen independently 
according to P2(2;0, z^, Zj, • ); 



• Colour each (fc — l)-hyperedge u e (^.f 1) by some z^ e Z^^x chosen 
independently according to ^^-1(^0, {zs)s(^u, • ■ • , (^")dg(^^2)' ' )> 

• Colour each fc-hyperedge e S ('^) by some colour in K chosen indepen- 
dently according to Pfc(zo, (•Zs)see, • ■ • , (zu),^^^^.^-^, • ). 

If [I is the law of the resulting random i^-coloured fc-uniform hypergraph, we 
will say that the ingredients yield [i upon following the standard recipe. 

Symbolically the standard recipe may be represented as the following recur- 
sive construction of a family of exchangeable random hypergraphs terminating 
in /i: /io on Zq is given, and then we define 



Ml := (idz„, Pf •^)#Aio on Zo x Zf , 



2 



M2 := I \^ZaMzf^ 09 P2 o (idzo,7r^ap | /xi on x x Z2 



on Zo X Zf X • • • X 
/i = I Pfc o (idz„,7r('e^, . . . ,7r^^.^p I /Xfc_i onA'('"). 

Theorem 2.9 (Structure theorem for uniform exchangeable random hyper- 
graph colourings). For any k-uniform exchangeable random hypergraph fi there 
is some sequence of ingredients 

(Zo, Mo), (^1, Pi), . . . , (^fe-i, Pfc-i), (/^, Pfc) 
which yields fi upon following the standard recipe. 
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Example. In terms of the structure theorem the iJ-samphng random {0, 1}- 
coloured hypergraph for H a fc-uniform hypcrgraph with loops on a finite 
vertex set V has the foUowing ingredients 

({*}), {V, ), ({*}, ), . . . , ({*}, Pf„i), ({0, 1}, Pf ) 

where we write {*} for a one-point space, is the uniform distribution on V ^ 
P.^i is idcnticaUy (5» for i = 0, 1, . . . , /c — 2 and 

Let us note here that the forms of the structure theorems for exchangeable 
random colourings that we have adopted above are not quite the same as those 
favoured in the earlier probabilistic literature on the subject, such as Aldous [3] 
and Kallenberg [44]. In particular, while we have allowed the use of rather ab- 
stract spaces and probability kernels as structural ingredients for our exchange- 
able laws fj,, it has in the past been popular to exhibit a stochastic process with 
a representation in terms of simple ingredients of a different kind whose law is 
/X. For example, the version of Theorem 2.9 that follows most easily from the 
formalism of Kallenberg [44] reads as follows: 

Theorem. For any k-uniform exchangeable random K -coloured hypergraph ^ 
there is some measurable function f : [0, !](*=) —> K such that, if (^a)(jg^ s -j 

is a collection of independent uniform ^,\]-valued random variables then the 
stochastic process (./(Ca)aCe)gg^s'j has joint law fi. 

We shall discuss these older 'representation' versions of our results for graphs 
in Subsection 3.2 and for hypergraphs in Subsection 3.3, but in fact the original 
probabilistic theorems correspond most closely to the setting of random colour- 
ings of partite hypergraphs, and so we shall defer most of our discussion of them 
until we address this setting in Subsection 3.7. This older formalism will also 
re-appear in Subsection 4.2, where we shall find it paralleled very closely by 
the results obtained recently by Elek and Szegedy on 'limit object' representa- 
tions for the statistics of large graphs and hypergraphs: their 'limit objects' can 
simply be identified with function / above, ast 

The forms given for the structure theorems above suggest various questions 
on what further refinements might be possible. We will show in Subsection 3.6 
that in general a simple exchangeable random hypergraph does require type 
spaces of several ranks in its recipe: it cannot be recovered using only a mixture 
of vertex-sampling random graphs for some standard Borel probability space of 
sample vertices. 

A slight subtlety in our means for obtaining our structural ingredients is the 
following. Given an exchangeable random hypergraph colouring /i, we might ask 
whether it can be that there not only exists some suitable collection of ingre- 
dients {Zi, Pi) yielding ^, but that the probability kernels of these ingredients 
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can be recovered as factors of the original probability space That IS, 

can we obtain our products of the type spaces Zi and the probability kernels Pi 
from some tower of factors 

— > (^1 — ^ (2^0, Mo), 
where (setting '■— m) the image measure 

Mi := • ■ • (0fc-2)#(0fc-i)#M 

is precisely the exchangeable random i-uniform hypergraph Z^-colouring ob- 
tained by stopping our recipe at level i? 

This is possible in the case of de Finetti's Theorem 3.1 (that is, k = 1), 
where one option is to obtain relative independence over the (SymQ(S')-invariant) 
'tail tr-algebra' of the original system. However, it turns out to be impossible 
in general if fc > 2: in Subsection 3.6 we will also present an example of an 
exchangeable random graph for which we must use unrecoverable quasifactors. 



2.3. Relations to combinatorics: correspondence principles and 
limit objects 

There are certain theorems of combinatorics that can be conveniently approached 
by first converting the original combinatorial data into a related kind of stochas- 
tic process, and then applying to that process the more analytic methods of 
probability or ergodic theory. The successes of this approach arc perhaps most 
striking in arithmetic combinatorics, and in particular in the ergodic-thcoretic 
approach to Szemeredi's Theorem discovered by Furstenberg in [.:>()] (for an 
overview of these relationships, see also Bergelson [15] and Tao and Vu [64]). 
Indeed, several more general results of density Ramsey theory in the arithmetic 
or related settings are still known only by such methods. 

The machinery of exchangeable random hypergraph colourings stands in a 
similar relation to finitary hypergraph theory as does the ergodic theory of In- 
actions to arithmetic combinatorics. While it is not clear that this 'correspon- 
dence principle' can have such powerful consequences for finitary hypergraph 
theory as for its arithmetic counterpart, there are certain kinds of combinato- 
rial question for which parallel versions can be set up for exchangeable random 
hypergraph colourings which may then either be instructive or interesting in 
their own right. We shall give a brief overview of these parallels here, and will 
then examine two particular examples, coming from property testing and ex- 
tremal combinatorics, in Subsections 4.5 and 4.6. 

Let us first describe informally how this 'correspondence principle' might 
arise. Suppose we are given a combinatorial structure T of very large finite size 
and a dense subset E <Z T. Suppose further that the structure T has a natural 
group r of symmetries, and that we are interested in the 'statistics' of the subset 
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E that count the number of eopies of a fixed (small) pattern L C_ T lying inside 
E: that is, the number of g G F with g{L) C E. Then we can form from E an 
associated random subset E oi X, by setting E := g{E) for a symmetry g G F 
chosen uniformly at random; letting P be the law of E in it is clear that 

this is F-invariant for the canonical action of F on For each t € T, we 

can consider the indicator function of the event At :— {E 9 t}; evaluation of 
all products of these indicator functions for different t E T describes completely 
the law P. 

The count we wish to make of those g such that g €i T, upon renormalizing 
by |F|, is now just the probability P(E 13 L). It may be that some of the 
properties of these probabilities can be established by studying the possible 
structure of general such F-invariant laws P on {^j^^), in particular, sometimes 
the asymptotics of these properties may be more easily analyzed by considering 
a nested sequence Ti C r2 C . . . of larger and larger index sets, and then 
defining from the associated invariant laws P„ some vague limit P on (^^) for 
a suitable countably infinite index set Too- 

The correspondence principle of interest to us in the coloured hypergraph 
setting corresponds to the above outline in the case that T is the complete 
fc-uniform hypergraph on some large finite vertex set Sn and F is the group 
of automorphisms of T given by vertex set permutations; after taking a vague 
limit as above, we are naturally left with an exchangeable random fc-uniform hy- 
pergraph (equivalently, a {0, l}-coloured hypergraph) on the countably infinite 
vertex set S := Un>i ^n- 

It is this relation that leads to the overlap between counting substructures 
of finite coloured hypergraph and exchangeable random coloured hypergraphs. 
In view of this it is only to be expected that various properties relating to the 
first-order statistics of these finite objects have analogs for their infinite random 
counterparts, and that arguments relating to these properties are often skeletally 
similar in the two categories. 

In the setting of arithmetic combinatorics, a slightly more delicate such cor- 
respondence principle underlies Furstenberg's analysis of densities of arithmetic 
progressions in a dense subset of {1, 2, ... , N}. These are translated into prob- 
abilities of multiple recurrence in probability-preserving systems, the analysis 
of which then leads to the ergodic-theoretic proof of Szemeredi's Theorem. Of 
course, in Furstenberg's case the translation to an infinitary result made the 
proof of the finitary result much more manageable through efficiently hiding a 
large number of intricate estimates required in a finitary version of the argument 
(consider, in this relation, Tao's finitarization of the proof in [63]). This situa- 
tion is surely rather rare; in the hypergraph setting, it seems more common that 
where parallel versions of theorems are available, their proofs are of comparable 
sophistication on either side of the divide. 

We should stress that many finitary questions do not have a sensible infini- 
tary analog, since in order to do so they must take a sensible 'asymptotic' form 
as the orders of the hypergraphs in question tend to oo. Among those that do 
exhibit this form, the infinitary version of the question typically bears only on 
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the finitary version up to 'leading-order'. For such questions, we are often able 
to sharpen the finitary version to ask for higher-order information — perhaps 
by asking for estimates on rates of convergence, as opposed to the mere fact 
of convergence — that has no clear infinitary analog: O(y^) is more than just 
o(n). Indeed, in some cases it is only after raising the stakes in this way that 
the finitary question seems interesting, and so the infinitary version may appear 
somewhat degenerate. For example, it is hard to imagine a modification of The- 
orem 2.9 that can be brought to bear on the finer Turan problem for bipartite 
graphs (for which the leading-order density is 0, and this is already the answer 
to which the infinitary approach pertains) or on the problem of Zarankiewicz 
(see, for example. Chapter IV of Bollobas [16]). 

Two specific arenas that do allow a parallel study of finite hypergraphs and 
exchangeable random hypergraphs are questions of efficient property testing (to 
be considered in Subsection 4.5, and at much greater length in the forthcoming 
paper [f 0]) and extremal questions (to be considered in Subsection 4.6). To date, 
there are some early indications that this parallelism may occasionally bear fruit 
in the study of finitary questions, such as recent results of Razborov [54, 55] on 
an instance of the graph-copy problem, although it is surely too early to judge 
whether a trend may emerge. 

Insofar as the structure theory for exchangeable random coloured hyper- 
graphs is of use in proving results in the above settings, in principle the ways in 
which it can be used should be replicable purely in terms of counting substruc- 
tures of the original finitary graphs or hypergraphs. The benefit to be derived 
from an examination of the infinitary latter machinery may lie largely in the 
more convenient organization of such finitary arguments in the future, rather 
than in forging a new way actually to write out proofs. In place of the infini- 
tary structural analysis of exchangeable random hypergraphs, purely finitary 
arguments tend to rest heavily on Szemeredi's graph regularity lemma (see, for 
example. Sections IV. 5 and IV. 6 of Bollobas [16]) and its later hypergraph ver- 
sions developed by Nagle, Rodl and Schacht [53] and by Gowers [36]; these serve 
as a kind of 'structure theory' in the finitary category, and show some interesting 
similarities with the exchangeability theory (although deriving either from the 
other does not seem quite straightforward). We shall discuss this comparison a 
little further in 4.4. 

3. Exchangeable families of random variables 

We give an analysis of many of the instances of exchangeability listed above, and 
provide some examples to illustrate different phenomena that can arise in the 
process. We have tried to arrange our particular examples in roughly increasing 
order of generality and sophistication. 

3.1. Warmup: de Finetti's Theorem 

We first recall a version of de Finetti's Theorem ([18, 19]) and its use to deduce 
a structure theorem for the special case fc I of the setting of random hyper- 
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graph colourings. This serves as helpful motivation for later, more complicated 
instances. 

Since our vertex set S is infinite, it can be partitioned into two infinite subsets, 
say as 5 = 5*1 U S'2. Now observe that within the canonical family of random 
variables {■Ks)s^s, the subfamily {tIs)s£Si has the same joint law under any 
bijective identification of S with 6*1, say ip : Si ^ S . Indeed, it suffices to check 
this assertion for the finite-dimensional projections: if si,S2,...,Sfc G Si and 
Ai,A2, ...,Ake Sa- then 

{■Ks^)#^^{^^s^ e Ai,7rs2 <E A2,...,TTs^ e Ak} 
=^^{7rsi e Ai,TTs^_ e A2, . . . ,7rs^ e Ak} 

= e ^1, 7r^(s2) G ^2, • • ■ , 7r^.(sfc) e Ak} by exchangeability 



as required. 

Therefore it suffices to prove our structure theorem for the projection (tts^ 
instead of for the original measure /i. This sleight of hand now allows us to use 
the remaining random variables {TTs)seS2 'poll vertices': we shall deduce the 
required form for the joint law of {7rs-^)^fJ- by showing that the {'Ks)seSi are 
conditionally independent given the behaviour of the (TTs)seS2- 

It turns out that with a little more care it is possible to do without this de- 
composition of S, effectively by letting 6*2 D 6*3 D • • • be a descending sequence 
of infinite-coinfinite subsets of S with empty intersection, and conditioning only 
on the tail tr-algebra ni>2 '''("'Si)- it turns out that this retains 'enough infor- 
mation' about the joint behaviour of the individual tt^ that wc can still prove 
the necessary conditional independence. However, the analog of this argument 
fails in the more complex cases of exchangeability to be considered later (we 
will justify this with an example in Subsection 3.6), and so wc shall not pursue 
it further here. 

Thus, writing Zo := K^^ and letting /ig G Pr Zq be the projection (Trs^):^^ 
of /.J onto the auxiliary coordinates, these will themselves be our structural 
ingredients. 

Theorem 3.1 (Variant of de Finetti's Theorem). The variables tTs, s G Si, 
are relatively independent over ttsj and are such that for any f e L°°{K) the 
conditional expectations [/ o tTs | TTg^ ] are ( almost surely ) equal for all s G Si. 

Proof. An essentially analogous proof but using the abovementioncd tail a- 
algebra can be found under Theorem 11.10 in Kallenbcrg [45]. Suppose that 
F = {si, S2, . . . , Sr} C 5*1 and that we are given /i, /2, . . . , /r € L°°[K). We 
need to show that 



{T'^')#^i{T:st e Ai, 7r,,2 € A2, . . . , 



T^Sk G Ak}, 
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By induction on r it clearly suffices to prove that 



_i<r 



and hence that for any A G cri'^S2) have 



/ TT /i O T^s, = / E^[/i O TT^. I TTsJ • TT /j O TTs; dpi. 

^^»<. -J A V,=2 / 

By the approximability in /i of any Borel subset of K^'^ by a finite-dimensional 
cylinder set, we may assume that A depends only on coordinates in some finite 
E C 82- Therefore, choosing coordinates S2, S3, . . . , sj, G 6*2 \ E, by exchange- 
ability we know that 

/ Yi fi° ^s, dfi= fl° T^si ■ [ Yl ° '^s' ^M, 

and now both A and the functions ft o tt^/ , i = 2, 3, . . . , r, depend only on the 
coordinates tTs for s € S2, so certainly we may replace /i o tTs^ with E^[/i o 
TTsj I TTSj] in the right-hand-side above: 



d/i. 



i<r 



This is almost what we want: it remains only to move the vertices s'^ back to 
Si to leave the desired equality. However, this requires just a little care: while 
we certainly still have exchangeability to appeal to, we must make sure that the 
conditional expectation E^[/i o tt^^ | tts^] is actually independent of the coordi- 
nates TTg^ , TTs^ , . . . , TTsj^ , SO that upon swapping them back with tTs^ , tt^, , . . . , tt^^ 
this conditional expectation is unchanged. 

This is so because for any infinite S'3 C ^2 \ (i? U {s'2, Sg, . . . , s^}), we know 
that {7TsiUS3)#fJ- is just another copy of /i, and hence, in particular, that 

II 1 1 2 1 1 1 1 2 

||E^[/i0 7r,i IttsjJII^ ||E^[/i0 7r^^ KsslL- 

Since, on the other hand, 

E^[/l O TTs, I TTs,,] = E^ [E^[/l O TTs, | TTsJ | T^Ss] , 

this equality of L^-norms can hold only if the two conditional expectations are 
actually equal: 

E^[/l O TTs, I TTS.,] = E^[/l O TTs, | TTg J . 

We will sometimes refer back to this as the 'tail property' of these conditional 
expectations. 

It follows, in particular, that E^[/i07rsj | Trg^] does not depend on tt^^, tt^^, . . . , 
TTs' ] SO it is invariant under the coordinate-permutation that swaps (s2, S3, . . . , s^) 
with (sj, S3, . . . , sj,), and we are left with the desired equality. □ 
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Remark. It is clearly essential for this proof that S be infinite, and in fact it is 
not hard to construct examples of measures enjoying the analogous symmetry 
with finite S for which the conclusion fails. < 

We can now deduce Theorem 2.5 immediately. 

Proof of Theorem 2.5. Conditioned on -ks^ ■ Zq, the measure {nsi)#fJ. 

disintegrates as an integral of fibre measures; and by the above these are all 
product measures, giving the desired form 

JZo 

where for Pi : Zq K we simply take the conditional law of any individual one 
of the random variables tt^ for s £ Si over Trg^ ■ By the equivalence between fj, 
and (TTs-i)^fJ' described previously, this completes the proof. □ 

We will later return repeatedly to the above trick of considering the structure 
of the projected measure (tttJ^/^ instead of that of ^ itself for some suitable 
infinite subset Ti of the index set T, in order to set at our disposal an additional 
infinite pool of random variables {TTt)t^T\Ti corresponding to 'reference vertices'. 
In the coloured hypcrgraph setting with T ~ (^) we will take Ti :— (^^) for 
some infinitc-coinfinitc Si C 5, and will then refer to (Tr^-sip^/x as the induced 

random hypergraph on Si, by analogy with the usual combinatorial term for 
an induced subhypergraph. This idea of selecting some infinite subset of vertices 
to 'poll' when analyzing the part played by other vertices appears explicitly in 
Tao's approach to an infinitary hypergraph removal lemma ([()2]). 

3.2. Exchangeable random graph colourings 

Wc will now consider the slightly more complicated setting of exchangeability 
for random graph colourings. Naturally, it will be subsumed by the treatment 
of hypcrgraphs of arbitrary rank in the next section. 

To prove Theorem 2.9 in the special case k = 2 we must extract from a 
given exchangeable random X-colouring of (2) an exchangeable random Zi- 
colouring of S for some suitable auxiliary colour space Zi . As in the case of de 
Finetti's Theorem, this Zi will emerge explicitly from a decomposition of 
corresponding to a partition of S into two infinite subsets, S = 5*1 U 5*2. Our 
first step, then, is to write 

X(f) = X X^ix^^ X = X (K'^^-f' X 

where we identify K'^^^i'^y- |enSi|=|enS2|=i} .^j^^ ^-Sixs^ ^nd then with {K^^f^ 
in the natural way. 
Now let Zi := if^^ x 

and for s £ Si write tt^^ for the composition of 
the projection 7r/{s}xS2\ : A'^^) K^^'^i ° 2^^)' "^^J' x K^^^) with the natural 
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identification of the first of these factors with Zi . It is clear that 7rf ^ is invariant 
under for any t/ S SymQ(S') that fixes {s}U5'2, that if s G 5i and g G SyinQ(5i) 
then Trfi o = T^g^^y and that the a-subalgebra VsgSi ^(^?^) SyniQ(S'i)- 
invariant. In addition, for any / € L°°{K) we have 

E^[/o7r,j7rf] = E^[/o^,,orf Kf] 

whenever ei, e2 S {^2) share the vertex s and g S SyniQ(5i) fixes s and sends 
62 onto ei: this follows from the symmetry and the fact that tt^^ is fixed by any 
such permutation g. 

Lemma 3.2. The random variables {TTf^)s£Si ^'^s exchangeable. 

Proof. This is a direct verification: if si, S2, . . . , Sk G 5*1, 5 G Symg(5'i) and 
Ai G a{7rf.^) for each i < k, then 

\i<fc / \i<k ) 

(by the permutation invariance of//); but if = (7r^i)~^(i?i) with G for 
each i <k then the sets involved in the two sides of this equation are precisely 
n.<fc{^s^' e and n.<fc{^f(s,) e respectively. □ 

Now we turn to the main estimates that will underly our structure theorem. 

For convenience, let us write T(s) for the cr-subalgebra (7rfi)^^(I]2j C 
and set T := T^^ := V^gSi T(s)- 

Note that we could repeat the whole of the preceding discussion but defining 
the maps Trf^'^^ using only s and vertices inside an infinite subset 6*3 C ^2, 
rather than all of 6*2; we will need to have this modified construction at our 
disposal also, and so write T'^'' for the a-subalgebra obtained just as was T'^^ 
but from these more restricted factor maps tt^^^'^^ (using only the smaller infinite 
subset 5*3 C S2 as the set of auxiliary vertices). 

Lemma 3.3 (Tail property of conditional expectation). Whenever 6*3 C ^2 is 
a further infinite subset, e G (^^) and f G L°°{K), the conditional expectation 
[/ o TTg I T] is actually measurable with respect to T"^^ . 

Remark. This assertion is analogous to the 'tail property' of the conditional ex- 
pectations that we used in the last step of the proof of de Finetti's Theorem 3.1. 
However, whereas in that case this property is actually enough to justify the 
use of the tail cr-algebra (as we remarked after the statement of the theorem), 
here this next step fails: the point is that we would now need to take an inter- 
section of the form n,;>2 ^ ^'^'^ some decreasing sequence 52 3 6*3 3 . . . 
of infinite sets with empty intersection, but we cannot in general then equate 
this with (ni>2'^'^') ^ (nj>2'^^') (even up to /i-negligible sets), as wc would 
need to do in order to work with a separate tail tr-algebra for each vertex s € S. 
In Subsection 3.6 we will see an example that witnesses this. < 



T. Austin/On exchangeable random variables 



102 



Proof. This follows from exchangeability and a simple 'energy squeeze' argu- 
ment. Since T'^^ C T'^^, by the iterability of conditional expectation we certainly 
have 

E,[f o I T^-^] = [E^[/ o TTe I T^^] I T^^] , 

and so, in particular, E^[/ o | T'^^] is the image of Ep[/ o ir^ \ T^^] under an 
orthogonal projection in L^{iJi). It will therefore follow that they are equal if 
they have the same norms in L^(/i). However, we know that if e = {s,t\ then 



|E^[/o^,|T^^] 



sup ||E^[/o7re I (T{{■Ka)^^IU}yJF^)\/a{{^la)^^({^}uF^)] 
■Sn finitn \ 2 J \ 2 J 



FCS2 finite 

and in the expressions within this supremum the value of 



depends only on \F\, since by exchangeability we may choose a permutation of 
5* that fixes and swaps F with any other subset of ^2 of the same size, and 
this will act as an isometry on L^{n)- Since S3 is still infinite, and so contains 
arbitrarily large finite subsets, it follows that also 



|E^[/o^,|T^^]||^ > ||E^[/o7re|T 



2 

2 - ll-MLJ - -e I - j||2 

> sup ||E^[/o7re|cr((7rQ) .{.}uF0Vcr((7ra) .{nuFO]|| 

FCS2 finite ^ ) afel 2 ; II 

^ ||E^[/0 7re|T^^]||2, 

and so the desired energies must be equal and the proof is complete. □ 

Proposition 3.4 (Relative independence). The random variables iTe for e S 
2^) are relatively independent over T; for any finite F C (^2) ^'^^ bounded 



measurable functions fe'-K 



for e £ F we have 



LegF 



T 



= []E^[/,o^e|T] 



e(EF 



Proof. Enumerate F as {ei,e2, . . . ,6^}. By induction on r it suffices to show 
that 



p=i 



T 



E^[/ei OTTei I T] ' E^ 

This identity asserts that whenever A G T we have 



n ° 

p=2 



T 



/ Jl /e O d^ = / E^[/ei O TTgi | T] • E^ 



Lp=2 
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and so, since both A and E^[/ei o TTg^ | T] are T- measurable, this will follow if 
we only prove that 

/ n /e O 7I"e d/^ = / E^[/ei O TTe^ | T] • J]^ /e^ O TTe^ d/i. 

By continuity of expectation, wc may restrict our attention to the case of ^ a 
finite-dimcnsional cylinder, say A — AinA2n- • -HAm for G (^iiT^e^^^^ivjyuRj^) 

for some yj £ Si and finite Rj C S2, I < j < m. 

Let ei = {si, S2} and enumerate all the remaining vertices in ({yi, 1/2, ■ • ■ , 2/m}U 
ei U 62 U • • • U Gr) \ 6; as zi, zi. . . . , Zq. Select distinct vertices wi, W2, ■ • ■ , 
G S'2 \ U i?2 U • • • U and let g £ SymQ(S') be the permutation that 
swaps Zi and Wi for alH < g. 

By the exchangeability we know that 

/ Y[feOTTedn= / Jl fg{e)OT^g{e) df^ = / /ei OTTei • J| /g(ep) OTrgfe^) d/X, 

JA^^p Jr^(A)^^p JtO(A) p^2 

since g does not move si or S2. But now both t^{A) and 0^=2 /s(ep) °''''s(ep) are 
clearly T-measurable, since through the action of g we have moved all relevant 
vertices from Si into 5*2, except for si and §2; in particular, every edge g{ej) for 
j ^ i now has at least one end-point in 6*2. Therefore the left-hand-side above 
must equal 

/ E^[/ei O TTeJ T] • TT /^f^^) O Tr^f^^) d//. 

It remains to put the vertices we have just moved back to their original positions. 
As in the proof of de Finetti's Theorem, we cannot quite naively re-apply the 
permutation since the cr-algebra T = T^^ is not fixed under g. However, 
it turns out that the conditional expectation E^[/ej o tt^^ | T] does enjoy this 
invariance; for the infinite subset 5*3 := 5'2\(i?iUi?2U- • ■yjRm'^{wi,W2, ■ ■ ■ , Wq}) 
(and so also the cr-algebra T"^^) is invariant under g, and by Lemma 3.3 we know 
that we may replace T'^^ with T"^^ in the conditional expectation of interest. 
Therefore re-applying g after making this replacement shows that the above 
integral is equal to 



J A , T 



Now switching back from T'^^ to T'^^ completes the proof. □ 

Corollary 3.5 (Structure theorem for random coloured graphs). For any ex- 
changeable random coloured graph fj. there are ingredients (Zq, (Iq), {Zi, Pi), 
(A', P2) which yield fi upon following the standard recipe. 
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Proof. As remarked previously, we may easily find an isomorphism between ^ 
and the infinite induced random coloured subgraph (tt^si^)^I-I, and so we may 
prove the result for the subgraph and then simply pull the whole structure back 
under composition with some bijection ijj : S Si. However, Lemma 3.2 and 
the argument of Proposition 3.4 show that our projections {TTf'^)seSi give the 
desired relative independence properties for the edge colours in 5*1 and that 
their joint law is an exchangeable law on Zf ^ . Now de Finetti's Theorem 3.1 
allows us to disintegrate the exchangeable measure fii further over some other 
Lebesgue space (Zq^^q), as described, completing the proof. □ 

We note here that the form of our Structure Theorem 2.9 is somewhat dif- 
ferent from that appearing in the original probabilistic literature on exchange- 
ability. For example, the analog of Theorem 3.5 that follows most directly from 
a suitable specialization of Theorem 14.11 in Aldous [.'!] is the following. 

Theorem 3.6. For any exchangeable random K-coloured graph fi there is some 
measurable function f : [0,1]"* K such that given any uniform [Q,l\-valued 
random variable and uniform [0, l]-valued processes (^s)ses '"^'^ (^e)eg^s^; 
independent, then the stochastic process (/(Coi (jg^^s^ has joint 

law ^. 

Thus, this result represents the joint law /x using the rather different ingre- 
dient of a measurable function [0, l]^^^) ^ j{ However, it is relatively easy to 
move between these two versions of the theorems using standard results on re- 
placing probability kernels with dependence on additional independent random 
variables; we return to these matters in more detail and exhibit the necessary 
techniques in Subsections 3.7 and 4.2. 

3.3. Exchangeable random hypergraph colourings 

We will now turn to the full version of Theorem 2.9. We begin by introducing 
some more notation; this will not only be important for the organization of the 
present subsection, but will recur in the proof of the positive results of [!()]. 

The skeleton of our proof is largely as in the case of graphs; however, the 
reduction to de Finetti's Theorem for the last step of the proof of Corollary 3.5 
now grows into an induction on the rank k. Importantly, even if we begin our 
analysis in the case of a uniform exchangeable random hypergraph colouring, 
say n G k('') for some fixed k, the inductive step of the argument will generally 
introduce a non-uniform rank-(A: — 1) hypergraph colouring on a space of the 

form Zq X Zf x • • • x .^jl^Tj^ for some nontrivial auxiliary standard Borel spaces 

z,. 

It is simplest to work in this larger category from the beginning, and so let 
us first make the obvious extensions of Definitions 2.1 and 2.2 to this purpose: 

• For k > 1, by a fc-palette we shall understand a sequence K = {Ki)^^Q 
of standard Borel spaces (not necessarily finite sets), and shall refer to k 
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as the rank of the palette as in the finite case of Definition 2.1 

• Given a vertex set 5' and a general fc-palette K = {Ki)'^^^ we shall define 
a i^-coloured hypergraph on 5' to be a sequence Hi of Xi-coloured 
i-uniform hypergraphs on S. 

Let us also adopt in this subsection the notation 
for the space of all /iT-coloured hypergraphs on S. 

Definition 3.7 (Exchangeable random coloured hypergraphs). Given a fc-palette 
A', by an exchangeable random A'-coloured hypergraph on S we un- 
derstand a Borcl probability measure 11 on A'^'^) that is invariant under the 
coordinate-permuting action r of SymQ(5) defined by 

^'((^eU(|j)=K(e)U(|,)- 

It is clear that for each i <k the projection of onto the factor is an 

exchangeable random i-uniform ATi-coloured hypergraph /^i; the overall measure 
jjL may simply be regarded as a SymQ(S')-invariant coupling of these ^i. 

Having set up this slightly more general class of spaces, we shall deduce the 
Structure Theorem 2.9 from a generalization to the non-uniform case, relying 
on suitable extensions of Definitions 2.7 and 2.8. 

Definition 3.8. By a sequence of ingredients we shall now understand a 
sequence Zq, Zi, . . . , Z^-i of standard Borel spaces, a probability measure 
on Zq, a Borel map Ki : Zi ^ Ki for each i — 0, l,...,fc — 1 and (setting 
Zk := K) a probability kernel 

P^■. ZqX Zlx zP X • • • X Z^^i'^ Z, 

for each i = 1,2, . . . , k that is symmetric under the natural coordinate-permuting 
action of Sym([i]) on the domain. 

Definition 3.9. Given a sequence of ingredients 

{Zo,kq,^o), (Zi,ki,Pi), . . . , (Zfe_i,Kfc-i,Pfe-i), {K,Pk), 

by the standard recipe we understand the following procedure for picking a 

random member of K^^^: 

• Choose Z0 e Zq at random according to the law /io, and set tj^ ko(z0); 

• Colour each vertex s G 5 by some Zg G Zi chosen independently according 
to -Pi(2;0, • ), and set LOg := ki(zs) 

• Colour each edge a = {.s, t} G (2) by some Za G Z2 chosen independently 
according to P2iz^,Zs, Zt, ■ ), and set uja := K2{za); 
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• Colour each (fc — l)-hyperedge u e (^-f i) by some G Zk-i chosen 
independently according to Pfc_i(z0, (zs)sg„, . . . , •), and set 

• Colour each fc-hyperedge e e {^) by some colour uj^ & K chosen indepen- 
dently according to Pkizin, {zs)see, • • ■ , • )• 

If fi is the law of the resulting random X-coloured fc-uniform hypergraph, we 
will say that the ingredients yield fi upon following the standard recipe. 

Thus, in this non-uniform version of the standard recipe, we retain some of 
the information contained in the choice of z„ for each u € (^^) (specifically, its 
image k\u\{zu)), whereas in the fc-uniform case we threw all of this away at the 
last step save for the top-rank data {u}e)^^(^sy 

Theorem 3.10 (Structure theorem for exchangeable random hypergraph colour- 
ings). For any exchangeable random K -coloured hypergraph fi there is some se- 
quence of ingredients {Z^, no, IJ-o) , {Zi, ki, Pi), . . . , {Zk-i, Kk-i, 
Pk-i), {K,Pk) which yields ^ upon following the standard recipe. 

Let us begin our analysis. Suppose that K = {Ki)i^o ^ fc-palette, and again 
partition S into two infinite subsets, say 5*1 and S2', as in the cases fc = 1 and 
fc = 2 it suffices to prove the structure theorem for the projection (7r^sip#/i. 

Given this partition, we write 

i=0 i=0 i=0 

where we identify e e (f) having 16 511=1 with (e n S*!, e n S'2) € {^l) x (j.^.) 
in the obvious way. Note that we are currently going to use this factorization 
only for the top rank fc. 

Now let Yi := K^!"^ for each i < fc — 1, and write tt^* for the projection 

from ' onto the coordinate indexed by a € ( / )■ 

Considering the maps tTq for a S (^J,.) and also tt^' for a € (<fii), we can 
obtain from a /T-coloured hypergraph on 5 a hypergraph on 5*1 coloured by the 
palette K := [Kq x Yq,Ki x Yi, . . . ,Kk-i x Yk-i,Kk)- We will show that for 
the if-coloured random hypergraph on Si obtained from ^ by pushforward, say 
/Et, the Kk colours TTe of the fc-hyperedges e € ('^^) are relatively independent 
conditioned on the {Ki xYi) colours of all the sub-edges a C e. Thus, we will have 
obtained a random rank-(fc— 1) coloured hypergraph fi with enlarged palette K 
such that the original law fj, can be obtained as the image 

M= (KO,Kl,...,Kfe_l, (g) ^fe° Ve,( = )....,(,^)) „/^' 

where Pk may be obtained explicitly as the conditional distribution of any one 
of the projections TTe for e £ (■^^) over tt^ ^ ^ and Ki : Ki x Yi Ki is the 
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projection onto the first coordinate. This wiU become the top-level kernel 
appearing in our list of ingredients, and K^-i x Yfe^i will play the role of Zk-i- 
We will then be able to complete the proof by induction on fc, since we may 
make an inductive appeal to the structure theorem for the rank-(fc — 1) random 
hypergraph colouring {irf to obtain the remaining ingredients. 

[<k-lj 

For convenience, let us write T{u) for the (T-subalgebra of generated 

by the maps (tTq, tt^'"' )actii and set T := V ^( si \ T(it). Just as in the graph 

case, we shall also want to use the cr-subalgebra T'^=* obtained analogously to the 
above but using only the smaller infinite subset 5*3 C 15*2 as the set of auxiliary 
vertices. 

An energy-increment argument exactly analogous to that for Lemma 3.3 gives 
the following. 

Lemma 3.11 (Tail property of conditional expectation). Whenever S3 C S2 is 
a further infinite subset, e G (^^) and f € L°°{K), the conditional expectation 
[/ o TTg I T] is actually measurable with respect to T"^^ . □ 

Proposition 3.12 (Relative independence). The random variables Tie for e S 
('^^) are conditionally independent over T.' for any finite F <Z (■^^) and bounded 
measurable functions fe-K^M. for e G F we have 



Em 



fe O TTe 



T 



eeF 



Proof. Enumerate F as {ei, 62, ... , e^}. Arguing just as for Proposition 3.4, it 
suffices to prove that 

/ n /e O TTe d/i = / E^[/e, O TTe J T] • J]^ fe^ O TTe^ dfl 
''^eeF p=2 

for any finite-dimensional cylinder A = AinA2n- ■ -nAm for Aj S (T((7r„) ^(ajURj\) 

"^l <fc ) 

for some aj G and finite subsets i?j C 6*2, 1 < j < m. 

Let ei — {si, S2, . . . , s^} and enumerate all the remaining vertices in (ai U 
02 U • • • U am U ei U 62 U • • • U Cr) \ Ci as ^1, Z2, . . . , Zq. Select distinct vertices wi, 
W2, . . . , i«g S 5*2 \ [Ri U i?2 U • • • U Rm) and let g £ SymQ(S') be the permutation 
that swaps Zi and Wi for all i < q. 

By the exchangeability we know that 

since g does not move any vertex of ei. Now both t^(A) and np=2 /s(ep) °''^g{e^) 
are T-measurable, since through the action of g we have moved all relevant 
vertices from Si into S'2, except for those in ei; and |ei H Cpj < k — 1 for any 
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p > 2. In particular, every edge g{ep) for p > 2 now has at least one end-point 
in 52- Therefore the left-hand-side above must equal 

/ ^Mei ° T^ei I T] • TT /g(e^) O TTgfe^) dfl. 

Finally, by Lemma 3.11 the conditional expectation E^[/ei o tTej | T] is ac- 
tually measurable with respect to T^^ for 6*3 := 52 \ (i?i U i?2 U • • • U Rm U 
{wi,?i'2, • ■ • TWq})^ and is therefore r^-invariant; thus, re-applying g we may re- 
turn the vertices Wj to their original locations Zj, and so deduce that 

/ n •^^ ° '^'^ ^ / ^^[■^'=1 ° '^ei I T] • /e^ O TTe^ d/i. 

"'^ eGF p=2 

This completes the proof. □ 
Letting 

Pk : (A'o X Yo) X (A'l x ^1)'= x • • • x {Ku-i x Yk^i)i^-^) ^ 

be the distribution of any arbitrarily chosen tTe for e e ('^^j (by exchangeability 
the choice of e will not matter), and writing Z[ for Ki x Yi when i < fc — 1 and 
for ATfc alone when i ~ k, the above relative independence immediately gives the 
following. 

Proposition 3.13. If K is a k-palette and 1^1 is an exchangeable random K- 
coloured hypergraph then we can introduce new standard Borel spaces Zq, Z(, 
forming a k-palette Z' , Borel maps ki : Z[ — > Ki with Kk = id/ffc, 
an exchangeable random Z' -coloured rank-{k — 1) hypergraph jik-i coloured by 
them, and a Syu\[[k])- symmetric probability kernel 

Pk-.Z'^x {Z[f x • • • X {Zl_^)i^-^) ^ Kk 

such that 



□ 



We can now prove Theorem 2.9. 



Proof of Theorem 2.9. Starting from /x, we may apply Theorem 3.13 to obtain 
spaces for < i < A: — 1 enlarging the colour spaces Ki, together with a 
probability kernel Pk as in that theorem. This gives the desired structure at 
the 'top level' of fi. However, having obtained the new exchangeable random 
.Z'-coloured hypergraph fj,' with rank at most k — 1 from which fi can thus be 
extracted, we may iterate the conclusion of Theorem 3.13 to obtain greater and 
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greater enlargements Z[ ^ zf^ *^ , and at the i*'^ step also an 

additional probability kernel 

so that if once this iteration is complete we take Zi :— zf~^^ for each i <k—l 

and Pfc_i+i to be Plf!_^_^i composed with the factors Zj zj*-* for j < k — i, 
then these ingredients recover the full structure that wc want. □ 

As in Subsection 3.2, let us note that the form of our Structure Theorem 2.9 
is somewhat different from that following directly from the probabilistic results 
of Kallenberg [44]. We shall address this issue in more detail in Subsection 3.7, 
when we discuss the setting of colourings of partite graphs, since it is actually 
these that correspond most closely to the original probabilistic results; here let 
us simply state the extension to arbitrary-A: case of the older formalism: 

Theorem 3.14. For any k-uniform exchangeable random K -coloured hyper- 
graph /i there is some measurable function f : [0,1](''') K such that, if 
(^a)^gj- s -J is a collection of independent uniform [0, \]-valued random variables 

then the stochastic process {f{S.a)aCe)^^rs\ has joint law /i. 



3.4- Finer topological consequences of the structure theorem 

In later subsections we will describe modifications of Theorem 2.9 to a number of 
related notions of exchangeability. However, before doing so we wish to elaborate 
some more delicate topological features of the proof of the last section, in the 
case when the original palette K consists of finite sets. 

Our interest is in a sense in which our extraction of suitable structural ingre- 
dients for a given /x was 'continuous' in that original measure /x. This aspect to 
these theorems, while an immediate consequence of the proofs, seems to have 
been ignored in earlier works, since its consequences for probability theory are 
slight. However, they will be crucial for the use to which the structure theorem 
will be put in [10], and so we recount them here, and will restate them when 
we turn to directed hypergraph colourings in the next subsection (again, for the 
sake of applications in [10]). 

Let us re-describe the proof of the previous section in slightly different terms. 
In order to deduce the structure of the exchangeable law fj,, we first identify S 
with an infinite-co-infinite subset Si of itself through some bijection ip : S ^ Si, 
and then agree that by exchangeability it suffices to understand the structure of 
the projection (7r^si -j)^/i. We then make use of certain tr-algebras generated by 

the colours tte of the spare edges in (^) \ ('^^) to describe that structure, enough 
to justify the recursion clause of an inductive argument. 

If we unravel the induction, we may write instead that we have chosen a 
partition S'2,1 U 6*2,2 U . • • S2,k of S'\ Si into infinite subsets, and now the estimate 
of Proposition 3.12 shows that 
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• defining ttu''^^ to be the cluster of eolours 

(7re')e'CSiUS2,iU---US2,fc, e'n(SiUS2,iU---US2,fc-i)C'u 

for each u G (^■5iuS2,iu^-uS2,fc-i~j ^taking values in a space Zk-i given by 
the obvious product of copies of A'^'s), the highest-rank colours tt^ for 
e G (^1^) are relatively independent over {ttu''^^)^^^ (we defined ttu'''^ 
also for u in the larger collection (^'^i^-^2,iu--uo2,fc-ij j^^. ^uxihary use m 
subsequent steps); 

• next, forgetting about the ATfc-colours of any fc-subsets not contained in 
Si, and defining Hv''^^ to be the cluster of colours 

(^u'*' ^)ti'CSiUS2,iU---US2,fc-i, ti'n(SiUS2,iU---US2,fc-2)C'u 

for each v G (^•5'iuS2,iUj^uS2,fc-2~j (^taking values in new larger product space 
Zk-2 similarly to above), now the colours ttu^^^ for u G (j,'^\) are relatively 
independent over (tt^ '°~^),ygC si \ \ 



Simply by keeping track of which clusters of colours of subsets of U 6*2,1 U 
• ■ • U S2.k end up inside each of the spaces Zi during the above construction, and 
which are simply thrown away, we find that we have proved a 'continuous and 
functorial' version of the structure theorem in case K is finite (in fact, in case 
K is itself compact, metrizablc and totally disconnected). This is more easily 
stated with help from a little additional notation. 

Definition 3.15 (Hypergraph powers of maps). Suppose that Y = {Yi)^^^ and 
Z = {Zi)^^Q are fc-palettes, that A is a sequence of Borel maps : 1^ ^ Zi and 
that S is some vertex set. Then by the hypergraph power of A over S we 

understand the map A(S) : r(^) Z(^) defined by 

A^^^(y0, (yOies, ■ • ■ , (ye)eg(S)) (Ao(y0), (Ai(y,)),gs, . . . , (Afc(ye))^g(sp. 

It is clear that A^'^^ is continuous if each Y,; and Zi has a topology and 
Aj is continuous. We will use these hypergraph powers through the following 
construction. 

Definition 3.16 (Internalizing vertices). If 7^ is a fc- palette and /i is a random 
-fC-coloured hypergraph on S'iUS'2 for two disjoint infinite sets 5*1 and 5*2, then we 
can define the S'2-internalization of K to be the fc-palette K'^^^ := {Kf^^)'^^^ 
given by 

„ ( ( ^2 \ 
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and the S'2-internalization of /i to be the /v ^'^^-coloured hypergraph /x*'^^ on 
5*1 obtained by the canonical identification of k'^^^'^^^^ with 

X (A'f'-''' X A'flf X • • • X A'l^ X A'l)^' X • • • X K^^*-^ 
that results from partitioning 




and identifying of {u e : |u n S*!] = i} with (^/) x (^-O. 

Now the re-write of Theorem 2.9 to which we have been building is the 
following. 

Theorem 3.17. Suppose that K is a finite k -palette, fi is an exchangeable 
random K -coloured hypergraph on Si, S2 is some additional countably infinite 
pool of vertices and is the unique exchangeable extension of fi to a random 
K -coloured hypergraph on SiU S2- Then there are a k-palette Z = {Zi)'^^Q com- 
prising totally disconnected compact metric spaces (which are therefore home- 
omorphic to closed subsets of the Cantor space) and collections of continuous 
maps Ki : Zi —I- Ki and A.^ : Kf^'^ Zi such that under A^^-'(/i')*"^2 the ran- 
dom variables for \e\ ~ i are relatively independent when conditioned on all 
the random variables 7r„ with \u\ < i, and fi = k^^^A^^'' (^')*'^^ . □ 

It is this version of our result, together with its directed- hypergraph counter- 
part, that will underpin the later sections of [10]. (It will be formulated there 
using slightly more category-theoretic terms, as these will help with the organi- 
zation of that paper, but the content is easily seen to be identical.) 

3.5. Exchangeable random directed hypergraph colourings 

In the previous section we considered probability measures on the space 

Y[i<k indexed by the collection of subsets of S of size at most k; a natural 

extension of this setting is to work instead in the space ni<A: i^T^^"-'^'*'''^'' indexed 
by the sets Inj([i], S) of all ordered i-tuples with distinct terms in S for i < k: 
that is, of iiT-coloured directed hypergraphs on S. Let us first re-assign our no- 
tation from Subsection 3.3 in this new setting, now writing A'^'^^ for this space 
Y[i<k kI'^^^^^^'^\ Then we can make the obvious analog of Definition 3.7. 

Definition 3.18 (Exchangeable random coloured directed hypergraphs). Given 
the same data as above, by an random A^-coloured directed hypergraph 

on S we understand a Borel probability measure fi on A"*-^^ that is invariant 
under the coordinate-permuting action t of SymQ(S') defined by 
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In order to handle this new setup efficiently, we turn to the partitions 
Inj([i],S') := IJgg^s-j Inj([i], e). For each i < k we are considering a colouring 

from some fixed standard Borel spaces Ki of directed hyperedges (j) G Inj ( [i] , S) . 
This can be viewed as assigning to every undirected i-hyperedge e a colour from 
the larger space kI'^^^^^^''^\ with one factor oi Ki appearing for each of the il pos- 
sible enumerations of e, although there is no canonical way to choose a bijection 
between the enumerations of one i-hyperedge e and of another i-hyperedge e'. 

We will find that our structure theorem for random directed hypergraph 
colourings requires us to work with these clusters of colours indexed by the 
corresponding undirected edges. This will lead to an important difference from 
Theorem 2.9. In the undirected setting, at the i*'^ step of our recipe we choose the 
i-hyperedge colours independently at random from the distributions given by the 
kernels Pi depending on the previously-chosen lower-rank hyperedge colours. For 
directed hypergraphs, however, the choices of colours of directed i-hyperedges </> 
and "0 with different images are still independent, but for (f> and ^ two different 
injections from [i] into the same undirected i-hyperedge a G ('^) they are usually 
not. Rather, their dependence is coded into the probability kernel Pi, and is quite 
arbitrary subject only to the Sym([i])-covariance condition on Pi. 

We cannot hope for these last to be independent because the distinct ran- 
dom variables indexed by different enumerations of a single i-set a are always 
moved around together under vertex-permutations, and so, aside from moving 
the vertices around inside a, no averaging over different images and subsequent 
estimates on covariances can be used to pry apart their joint distribution. 

Nevertheless, this slight additional complexity having been taken into ac- 
count, the basic methods of the preceding sections give an analogous result to 
Theorem 2.9. We shall give the details of this statement here, but leave the 
(essentially unchanged) proof to the reader. 

Definition 3.19 (Directed ingredients). By a sequence of ingredients we un- 
derstand a sequence Zq, Zi, . . . , Zk-i of standard Borel spaces, a probability 
measure fio on Zq and (setting Zk '■= Kk) Borel maps Ki : Zi Ki for each 
j = 0,l,...,fc— 1 and probability kernels 

for each i = 1,2, . . . , k that are covariant under the natural coordinate-permuting 
actions of Sym([i]) on the domain and on the target. 

Definition 3.20 (Directed recipe). Given a sequence of ingredients 

(Zq, Kq, ^o), iZl,Ki,Pi), . . . , {Zk~l, Kk-1, Pk-l), {Kk,Pk), 

by the standard recipe we understand the following procedure for picking a 
random member of {i^4>) 4,e[j^^^inm,S) ^ 

• Choose Z0 G Zq according to /io, and set := ko(z0); 

• Colour each vertex s £ S hy Zg £ Zi independently according to Pi (zg, . ), 
and set uj^ ki(zs); 



T. Austin/On exchangeable random variables 



113 



• Colour each a ~ {s,t} e (2) by some pair (20)0ginj([2],a) G ^^"j([2],a) 
chosen independently according to P2{z(/,, Zg, zt, •), and set K2{z,p) 
for each (f> G Inj([2],S'); 

• Colour each u G (^f J by some (fc-l)!-tuple (-Z0)0Ginj([fe-i],u) in ^^'^^"^1^") 
chosen independently according to Pfc-i(-20, (zs)s6u, ■ ■ • , (■z»/.)veinj([fc-2],«), ■ ^ 
and set uj^ := Kfc_i(z0) for each G Inj([fc — 1], S); 

• Colour each e G (^) by some fc!-tuple (ijJ0)0einj([fe],e) G /-^^^"■'(['^l'^) indepen- 
dently according to Pk{ziD, (zs)see, ■ • ■ , (^i/;)^Ginj([fc-i],e), • )• 

If fi is the law of the resulting random A'-coloured fc-uniform hypergraph, we 
will say that the ingredients yield fi upon following the standard recipe. 

Theorem 3.21 (Structure theorem for exchangeable random directed hyper- 
graph colourings). For any exchangeable random K-coloured directed hyper- 
graph fj, there is some sequence of ingredients (Zq^kq^^q), 

Pfc_i), {Kk,Pk) which yields fi upon following the standard recipe. 

As in the case of undirected graphs, our application of this result in [10] will 
require the following 'continuous and functorial' version, which follows from the 
above argument just as did Theorem 3.17 from the argument of Subsection 3.3. 
We first make an analogous directed version of Definition 3.16; we will use the 
same notation, since no confusion should arise and this definition collapses to 
its predecessor if we identify undirected graphs as a special class of directed 
graphs. We will also use hypergraph powers of maps as in Definition 3.15, but 
this definition is essentially unchanged. 

Definition 3.22 (Internalizing vertices). If is a fc-palette and is a random 
A'-coloured directed hypergraph on Si U ^2 for two disjoint infinite sets and 
5*2, then we define the S'2-internalization of K to be the fc-palette K'^^^ := 
(^r')io given by 

z^-aS, .„ (l^J)xInj([/c-^].S.) (['=,-il)xInj([fe-.-l],S.) 

and the 52-internalization of fi to be the A"'*''^^ -coloured directed hypergraph 
^tMS2 obtained by the canonical identification of A'^'^^^'^^' with 

i=0 

that results from partitioning 

k—i 

|Jlnj(W,^iU52) = U U {0Glnj([i+j],^iU52) : Mt + j]) n Si\ ^ i} 

i<k i<k j—0 

and identifying {0 G Inj([i + j], S*! U S'2) : \(l}{[i+j])nSi \ = i} with Inj([i],5'i) x 
(''uf') ^ I^j(b]' "^2) in the natural way. 
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Theorem 3.23. Suppose that K is a finite k-palette, fi is an exchangeable 
random K -coloured directed hypergraph on Si, S2 is some additional countably 
infinite pool of vertices and ^' is the unique exchangeable extension of fi to a 
random K -coloured directed hypergraph on 6*1 U 5*2. Then there are a k-palette 
Q comprising totally disconnected compact metric spaces and collec- 
tions of continuous maps Ki : Zi — > Ki and Ai : Kf^^ ^Sym([2]) ^^^^ ^^^^ 
under A''^^\fi')'^^^ the clusters of random variables (7r0)^gi„jQj] „) for distinct 
u e ('^■^) are relatively independent when conditioned on all the random variables 
TT^ with i' e Inj(< i, Si), and ^ = kJ^^'^A^^'' (^')"^' • □ 

Before leaving this subsection, let us mention the following possible common 
generalization of the settings of undirected and directed exchangeable random 
coloured hypergraphs. Let us consider again only the /c-uniform case, as this 
illustrates all the new ideas. We now posit a whole family of colour spaces 
(i<re)eg(S) indexed by the hyperedges, together with an action g ^ of Symg 
on the space IleG(^) that involves both the composition with the usual action 

of g on the base space ('^) through permuting 5*, and also some nontrivial 'local' 
transformation by g of the value of tTe for an individual hyperedge e: precisely, 
r : SymQ(5') r\ neG(-^) is of the form 

T»((^e),g(S)) = «K(e))),g(S), 

where for each g e SymQ(5) and e e (^) the map erf is a Borel isomorphism 
Xg(e) — > Kf. which depends only on the restriction g|e and is subject to the 

'cocycle condition' that erf o cr^^g-, = erf °^ for all g, g' (so that we obtain a 
well-defined group action overall). 

Thus the map af provides a local transformation from the individual fibre 
i^g(e) to Ke, but different g with the same image g{e) may give different such 
transformations. In particular, restricting to those g with g{e) = e, we obtain 
a possibly non-trivial Borel action of Sym(e) on for each e. (We warn the 
reader that it cannot in general be reconstructed uniquely from the knowledge 
of only the action of Sym(e) on given by those erf with g{e) = e.) Given an 
action r of this form involving nontrivial local transformations, we can study 
those probability measures pi on nee(^) that are r-invariant. 

It is straightforward to check that the basic method we have outlined above 
can be adapted to give a structure theorem for such invariant measures simply 
upon making all of our ingredients suitably covariant under the local transfor- 
mations fff . From this we can recover Theorem 2.9 by taking K,, := K and each 
erf to be the identity, and Theorem 3.21 by taking := K^'^M^^'^) and erf to 
be the natural coordinate-permuting map 

('^0)06lnj([fc],e) ^ ('^0og)0elnj([fc],e) 

whenever g{e) = e. In this second setting we see the clusters of colours indexed 
by all directed edges with a given underlying undirected emerge again, just as 
in the earlier arguments of the present subsection. 
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3.6. Two counterexamples 

The need for several type spaces 

We will now give the first of the counterexamples promised in the last part 
of Subsection 2.2. We show that for simple exchangeable random hypergraph 
colourings the Structure Theorem 2.9 really does need all of the intermediate 
type spaces Zc it cannot be simplified in general to a construction requiring 
only sampling from some space Zi at rank 1. 

We will give an example of an exchangeable random 3-uniform hypergraph 
colouring that cannot be constructed as a mixture of vertex-sampling random 
3-uniform hypergraph colourings; it is easy to construct similar higher-rank 
examples. First choose for each e £ (2) independently and uniformly a random 
colour from the set {red, blue}. Now for each u G (3) let a;„ = 1 if at least one of 
the 2-subsets of u is coloured red; otherwise set lOu = 0. Our example will be the 
law of the resulting random 3-uniforni {0, l}-colourcd hypergraph {u)u)^^(^sy 

An easy check shows that this ^ is SymQ(S')-ergodic, so cannot be a nontrivial 
mixture of other exchangeable random hypergraphs. We will show that it also 
cannot be obtained by sampling vertex colours at random from some probability 
space (y, v) and then applying some symmetric probability kernel P : ^ 
{0, 1} (that is, we cannot jump over the intermediate step of randomly colouring 
2-edges in our recipe). Indeed, suppose that /x' is any exchangeable random 
hypergraph that can be constructed in this way, and consider, as in the main 
argument, a partition 5 = 6*1 U 6*2 of S* into two disjoint infinite subsets. For 
each s S we introduce the cr-algebra T'(s) :— o'((7r„)^gj-{3}us2-j), and can now 

compute explicitly from the assumed vertex-sampling-only structure of /i' that 
under ^' the 3-hyperedge random variables 7r„ for u G (3) must be relatively 
independent over the cr-algebra T' := VsgSi ^'^^)- other hand, a quick 

calculation shows that if ui, 7/2 € (3) share exactly one 2-edge e, then under the 



law n the joint behaviour of 7r„i and 7r„2 is already independent from the whole 
of T', but 



This contradicts the abovemcntioned relative independence over T' if /i = ^' , 
and so completes our argument. 

It is worth mentioning the similarity between the necessity of introducing a 
whole list of auxiliary spaces Zi and a general phenomenon surrounding versions 
of the regularity lemma for higher-rank hypergraphs: in the case of the latter, 
in order to yield effective counting lemmas they all require the introduction 
of a whole hierarchy of vertex-partitions, each dedicated to the control of a 
particular size of subsets of the vertex set V up to size k. We will not examine 
this similarity in more detail here, but refer the reader to Gowers' discussion 
in [38, 36]. < 
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The need for unrecoverable quasifactors 

Intuitively, one might wish to do a Httle more than the proof we have given of 

Theorem 2.9: to wit, obtain the spaces zf^^"^ for our Hst of ingredients as factors 
of the original hypergraph colouring probability space {K^^\ n) (we return to 
the convention of Subsection 3.3 for the notation ^K^^^^), without the trick 
of passing to an induced random coloured subhypergraph on K^^^\ In case 
k = 2 this would amount to showing that we can find an S'-indexed family of 
measurable maps (/>s : K^^'' Z\ having the following two properties: 

• covariance: (/>s o = 0g(s) for every g S Symp; 

• the edge random variables tTe are relatively independent over the combined 
map (0s)sgs- 

We will now show that this stronger conclusion can fail (and similarly in case 
fc > 2), and so some trick like the passage to induced subgraphs followed by 
structural pull-back is really necessary. We will give an example with fc = 2 in 
which, intuitively, while no one vertex of S'2 is important for the construction 
of the auxiliary spaces Z\ , we cannot do away with all vertices of S'2 without 
losing relevant information. 

We first derive some consequences for a family (0s) 3^5 of measurable maps 
of the first of the above conditions in case /i is an exchangeable random K- 
coloured graph (for a single colour-space K, not a more elaborate 2-palette); we 
will then show for our example ^ that we cannot find a family that also satisfies 
the second condition. 

For such [I, the cr-subalgebra (/)J^(E2j C ™ust be contained in the 

^-completion of the cr-subalgebra Ss C S of all those A G E that are invariant 
under Stabsym„(s) := {t^ : g{s) = s}: indeed, by the covariance of these maps 
if i? S and g(s) = s then 

We will use a certain approximability result for this cr-algebra . Since /i is a 
Radon measure, for any ^ G Ss and any e > we may find some finite / C S'\{s} 
and some finite-dimensional subset B C {0, 1}*^ = such that ii{AAB) < e, 
and so — < £• However, since A is fixed by Stabsymo(s), this gives 

also ||1a — Is o 'i'^\\L^{fi) for every g e Stabgym^ (s), and hence 



1 



geF 



< e 

L1(m) 



for all finite F C Stabsymo (s). Since this group (being isomorphic to Symp, for 
example) is amenable, by the ergodic theorem for amenable groups we may take 
a pointwise and L-'^-limit of averages over the members of some F0lner sequence 
for Stabsymo(s), and so obtain from 1^ some function / that is Ss-measurable 
and such that ||1^ — fWii^p) < e. Hence, since e was arbitrary, we have proved 
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that any A e hcs in the /i-completion of the a-subalgebra Too(s) Q S 
generated by functions, such as /, obtained as Stabsymo (s)-ergodic averages of 
finite-dimensional functions. Moreover, since 1^ may be written as a finite sum 

/{s}U/\ 

of indicator functions /{^ju^n =r;} for various rj £ {0, 1}*^ ^ we may further 

reduce to considering B of this particularly simple form. 

We are now ready for our example. Consider the exchangeable random ({0, 1}- 
coloured) hypergraph obtained from ingredients Zq := {*}, Zi := {red, blue}, 
and with conditional edge-colour distributions ^2(^1, Z2, ■) given by 

P2{Z1,Z2,{^}) = 1 - ^zuZ2 ■ 

that is, we sample a family {zs)ses of colours, red or blue, independently and 
fairly at random, and then insert an edge between s,< e 5* if and only if s and 
t are different colours. This is simply a random complete bipartite graph. 

We will show that for this /i the a-subalgebras Eg of E are 'too small' to 
allow the required conditional independence of the edge random variables TTg. 

/'{s}U/\ 

Consider a finite-dimensional cylinder set B C {0, 1}\ ^ ) corresponding 

to some rj € {0,1}^ 2 ) as above. Take the F0lner sequence for Stabsymo(s) 
comprising the family Fn '■— Sym(J„) for some increasing sequence Ji C J2 C 
J3 C . . . of finite sets with union S* \ {s}. Once n is large enough we must have 
J C J„, and then for lu £ X the function 

simply counts the number of isomorphic copies of the graph 77 that occur in 
w|^.7„^ if we insist that s does not move but choose an arbitrary size-|/| subset 

of Jn for the remaining vertices. 

For n very large, this statistic cannot distinguish between the two possible 
vertex colours, red or blue, since there is an automorphism of (Zi, P2) that swaps 
them: if we construct the probability measure on the whole of {0, X Z? 
that arises from our standard recipe, then conditional on either of the events 
's is coloured red' and 's is coloured blue', our above ergodic averages converge 
to the same (//-almost surely constant) value. Heuristically, for any vertex, an 
observer situated at that vertex and able to observe its connectivity to all the 
other vertices and the connectivity among those other vertices can accurately 
discern the two infinite clusters of the bipartition of the graph, but has no sta- 
tistical way to tell which is red and which blue, and so cannot tell the colour of 
any given vertex, including their own. More formally, since the above average 
converges to a constant for any such 77, we conclude that any ergodic-average 
function such as / above is /i-almost surely constant, and so the measure-algebra 
of {K^^\Es, fj-lsj is trivial for any s € S. It follows that for e = {s, t} the con- 
ditional distribution of TTg relative to V must be //-almost surely constant; 
since, on the other hand, the edge random variables tTe for different e are not 
independent under /t, it follows that our putative factor maps 4>s cannot have 
the second of the properties listed above. < 
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3.7. Partite hypergraphs and Gowers norms 

Let Si, S2, ■ ■ ■ , Sd be a finite list of disjoint countably infinite vertex set and 
k < d. We will now briefly treat random colourings of the complete A:-uniform 
hypergraph on the vertex classes Si, S2, ■ ■ ■ , Sd (restricting here to the case of 
a fixed rank k for simplicity). For each e € ('j^') we refer to the points of the 
product space Yiiee '^i hyperedges above e. Let us introduce the nota- 

tion (^SuS2,S3.---,Sd-^ ^Yic set IJee(M) Uiee fc-hyperedges of the com- 

plete such partite hypergraph. Suppose that is some standard Borel space 
for each e e ('^1), and consider a (i<re)gg^[di-) -coloured exchangeable d-partite 
fc-uniform hypergraph fi on these vertex sets: that is, a probability measure 

i-r ^' 

on the product space Oee(i'^i) that is invariant under the separate 

coordinate-permutation action of each SymQ(5'i) on Si. 

In this case one can prove another structure theorem along just the same lines 
as those reviewed above. It tells us that we may always build up an exchangeable 
random d-partite fc-uniform i^-colouring fi according to the following recipe: 

Ingredients: standard Borel spaces Zi, indexed by all b G [^j.) with Za = 
Ka when \a\ = fc; a probability measure fiQ on Z0; and probability kernels 
Pc ■■ ribed^it^) Zc for aU nonempty c G (l^'). 

Standard recipe: 

• Choose Z0 G Zijf at random from Z^ according to (Iq; 

• Choose Zg G Zi for each i G [d] and s G Si independently with law 

PziziD, ■); 

• Choose Zu G Zb for each b = {i,j} G ('2') and u — (si,S2) ^ Si x Sj 
independently with law Pb{zijj, z^^, z^^ • ); 



• Choose G Ke for each e G ('^') and w G Hiee independently with 
lawPe((z.„|J6e(<,:=_,)' ■)■ 

We have assumed no symmetry at all between distinct Si in the above de- 
scription, but it is not hard to see that such additional symmetry would be 
reflected in various further finitary symmetry constraints on the kernels Pa- 
Our main result is now the following. 

Theorem 3.24 (Structure theorem for exchangeable random partite hyper- 
graph colourings). For any exchangeable random K-coloured k-uniform hyper- 
graph there is some collection of ingredients {Zb),^([d]\, /io and (Pc) ^([d]\ which 

yields fi upon following the standard recipe. 

The proof that the above recipe can describe all exchangeable partite hy- 
pergraph colourings is essentially analogous to that for non-partite hypergraphs 
given previously, and we shall not spell out the details. In fact, the foundational 
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work by probabilists on exchangeability corresponds most closely to this setting 
when d = fc: in particular, exchangeable partite graphs with d ^ k = 2 first ap- 
peared in the probabilistic literature under the titles 'separately-exchangeable 
arrays' or 'row-and-column exchangeable arrays'. See, for example, the discus- 
sions in Section 14 of Aldous [3] and the introduction to Kallenberg [44]. 

These older theorems are not stated in quite the same formalism as that we 
have adopted in this paper, so let us briefly compare them. We shall take Aldous' 
treatment of the case d ~ k ~ 2 in Theorem 14.11 of [3] as representative of the 
traditional probabilistic approach; the presentation used by Kallenberg in [44] 
is very similar. Assume that k ~ d and Si = S2 ~ ■ ■ ■ ~ Sd and write K for 
Ke- Observe that if {S,b.w)bc[d].wes'' is a-ny collection of independent uniform 

( '''' ~l 

[0, l]-valued random variables and / : [0, 1]^^''' K is any fixed measurable 
function, then the stochastic process {tTw)w<£S'' ■= (/((Cb,(toi)ig(,)&Ce))^,g5d has 
law an exchangeable iT-colouring of S^; in this case we write that / represents 
the law of the process tt. The classical form of the structure theorem is now the 
following: 

Theorem 3.25. Any exchangeable random K-coloured d-uniform d-partite hy- 
pergraph fi is represented by some f. 

The difference between this result and Theorem 3.24 above can be seen as 
one of the choice of representing ingredients; however, it is not too difficult to 
move between them. 

On the one hand, if /.( is representable, then given / we may set Pe to be 

its conditional distribution on the factor [0, l]^^''-!' ^ [0,1]^'*'', and now set 
Zb [0,1] when |6| < fc — 1 and fiQ and Pc{z, •) to be Lebesgue measure 
for all z e [0,1] (--1) whenever \c\ < — 1; it now follows routinely that these 
ingredients yield precisely the joint distribution of the process (Tr^)^,^^^ upon 
following the standard recipe, and so we have deduced Theorem 3.24 for this 
instance of a representable process. 

In order to prove the reverse implication, it is necessary to process a collec- 
tion of ingredients Zb, jio and Pc by replacing each space Zb with a copy of the 
unit interval [0, 1] and then 'outsourcing' the noise in each probability kernel Pc 
to an independent random variable drawn uniformly from that interval. This is 
also a very standard argument, but we shall defer presenting this method until 
Subsection 4.2, where we shall use it to deduce a recent representation theorem 
of Elek and Szegedy [24] (Theorem 4.1) from our version of the Structure The- 
orem in the non-partite case, having proved our ability to perform this 'noise 
outsourcing' in a suitable form in Lemma 4.2. 

Before we leave this subsection, it is worth noting that it is (fc + l)-partite 
/c-uniform hypcrgraphs, not just ordinary fc-hypergraphs, that emerge naturally 
in the recent hypergraph regularity approaches to Szemeredi's Theorem (see, 
for example, Nagle, Rodl and Schacht [-53], Gowers [36] and Chapter 10 of Tao 
and Vu [04]). 

While the parallels between the structure theory for partite hypergraphs and 
finitary hypergraph regularity lemmas are much as in the non-partite case, in 
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this partite picture it is also quite natural to introduce an infinitary notion of 
the 'uniformity seminorms' that were first used by Gowers ([38, 3G]) on the way 
to his version of hypergraph regularity. 

Definition 3.26 (Gowers norms). If /i is an exchangeable law on 

nee([''i) Ke''"' ' and / e i°°(A'e) for some e e then we define the Gow- 
ers uniformity seminorm of / above e under to be 

II/IIuhm) ■■={ I n Z"^-. 

V-^ r,e{l,2} = 

where we have chosen arbitrarily a collection of pairs & Si for i e e and 

then written := (s^. )ige for S {1,2}^. It is clear that by exchangeability, 
any choice of distinct s\,s\ E Si for each i E e will give the same value. 

By mimicking the analysis in the finitary setting, it is easy to show that || • 
||u<!(p) is a seminorm. Given some complicated exchangeable partite {Ke)^^^id]y 

colouring /i, it is clear that generically all functions / S L°°{Ke) have ||/||u=(/i) > 
0; however, after implementing the structure theorem we obtain an extended 
partite non-uniform exchangeable random hypergraph colouring fl of 
[jj<k{^^'^^j"'^'') by some auxiliary palette {Za)aC[k] with Ke when 

|e| = fc, and it can be shown that on this enlarged product space the func- 
tions / G L°"{Za) for which ||/i|u°(/i) = are precisely those such that f o tt^ 
is /i-almost measurable with respect to (Tr^i'^)^^^^ a y for any choice of u; S 

riiea'^i- This closely parallels Lemma 4.3 in Host and Kra's use in [43] of an 
infinitary analog of the related arithmetic Gowers norms (see [37, 64]), and the 
proof is exactly similar. 

3.8. Models of simple theories 

Our structural results applied to hypcrgraphs, directed hypcrgraphs and tower- 
graphs can be embedded into a somewhat more general setting that has already 
emerged to a similar purpose in recent work of Razborov [-54] (to which we shall 
return in Subsection 4.3 below). We shall assume various definitions from model 
theory; see, for example. Chapter 1 of Kopperman [49]. Let T be a universal 
first-order theory with equality in a language £ that contains only predicate 
symbols. Let us suppose first that these symbols have arity at most some finite 
fc > 1, and (for convenience) assume that T has only a countable set S of such 
predicate symbols; assume further that T has infinite models. For each i < k 
let Si 'E S contain those symbols of arity i, and let Ki be the space {0, l}"^' 
with its product topology and Borel cr-algebra; points of Ki are to be regarded 
as truth-assignments to the predicates of Si . 

We should stress that we have slipped into the rather abstract lexicon of 
model theory for its convenience; for theories T as above, our guiding intuitions 
will remain those of measures with certain symmetries on a Cantor space. 
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If the theory T is free then its models with underlying vertex set N are 
precisely the maximal-rank-fc directed hypergraph colourings over N coloured 
by Kq, Ki, . . . , Kk^ except that now we must also allow 'loops': a tuple (xi, X2, 
. . . ,Xi) G W in which some coordinate appears more than once can also be an 
argument for an arity-z symbol. Thus in the free case our space of models over 
the vertex set N can be identified as X := Y[i<k ■ 

If T is not free (but still does admit an infinite model), we must correspond- 
ingly restrict to the subset Xj- of X := Y[i<k -^'^^ containing those points that 
are models of T. This is a closed, hence compact, subset of X, since any individ- 
ual interpretation of a sentence in T over some particular finite set of vertices in 
N simply carves out some clopen subset of X depending only on those vertices as 
coordinates; the existence of an infinite model is equivalent to the non-emptiness 
of the intersection of these clopen subsets. The resulting closed subset Xt is 
invariant under coordinate-permutation. 

Given any such theory T, we can consider the compact convex set Qt of 
Radon probability measures on Xq- invariant under the obvious SymQ(N)-action, 
with its vague topology, and ask for a description of the structure of these mea- 
sures. In fact, such measures have a long history in the model theoretic literature: 
see, in particular, the papers of Gaifman [.U] and Krauss [ol], and also the dis- 
cussion of these actions of Symg(N) as the 'logic actions' (although without the 
introduction of invariant measures) in Section 2.5 of Becker and Kechris [11]. Of 
course, we may identify these invariant measures as the exchangeable random 
hypergraph iC-colourings that are supported on the closed subset Xr of X, and 
so we do at least know that they can be described by the standard recipe, but 
now the additional constraints imposed by T translate (at least in principle) 
into additional 'fine-tuning' conditions on the ingredients. Various more precise 
questions may now be posed about these. For example: 

Question 3.27. Given a theory T having only function symbols of rank 2, when 
is it the case that any (say, ergodic) ^ with support in Xq- can he represented 
using ingredients P2 : j-{Sym{l2]) ^^^^ themselves be taken to be de- 

terministic maps, and hence correspond to measurable models of T with vertex 
set equal to some fixed copy of the spaces Z\ ? What happens if the rank is 3 or 
greater? 

It follows from results of Fremlin [28] that equivalence relations (which fit 
into the above picture with rank 2) do behave in this way, and a positive answer 
to Fremlin's Problem FY ([27]) would show the same for partial orders. On 
the other hand, in the free case of graphs and hypergraphs the need for non- 
deterministic probability kernels Pj in most cases is clear, and so these cannot 
satisfy the above condition. 

A different class of questions pertaining to essentially the same formalism as 
above are those around the testing of hereditary properties of coloured hyper- 
graph, to which we will return in Subsection 4.5, and in more detail in [lU]. 
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3.9. A weakened hypothesis: spreadability 

We will finish our review of the classical probabilistic theory by considering 
another direction in which many of the above forms of exchangeability can be 
weakened. We now suppose that T is a (necessarily infinite) index set and that 
r is a semigroup of self- injections of T (crucially, which may not be invertible), 
and in this context write that the law fi of the canonical process (jTt)teT is 
F-spreadable if {T^g[t))teT still has joint law /x for any 5 G F. As in the ex- 
changeable case, it turns out that if F is a sufficiently rich class of self-injections 
then these spreadablc laws ^ must still take quite a precise form (and, indeed, 
very often the resulting structure theorem for a spreadability context subsumes 
some result for a related exchangeability context on the same index set). 

Our leading examples of spreadability, as of exchangeability, correspond to 
spaces of hypergraph colourings over some countably infinite vertex set 5, but 
now with the additional data of a fixed total order < on S* and the requirement 
that our law jj, be invariant under the semigroup F of order-preserving self- 
injections of S. In the special case fc = 1, the spreadable generalization of de 
Finetti's Theorem was proved by Ryll-Nardzewski in [58]; the results for higher 
ranks k were then settled by Kallenberg in [44]. We shall discuss the methods 
needed for these structural results only cursorily here, referring the reader to 
this last paper for a complete account. We note that spreadability is referred 
to as 'spreading-invariance' (in the special case fc = 1) in Kingman [48] and as 
'contractibility' in Kallenberg's more recent book [40] . 

In the context of such hypergraph spreadable laws, the use of auxiliary ver- 
tices often requires the a priori observation that we have some freedom to choose 
the countable total order (S", <) (up to order-isomorphism), since our law on, 

say, K^'^'^ will always be determined by its finite-dimensional marginals, and 
so each spreadable law for any one countably infinite choice [S, <) determines 
uniquely such a law for any other such choice. Often the proof of the relevant 
structure theorem proceeds much more smoothly provided wc begin with a suf- 
ficiently rich total order (5, <), in order to provide enough 'reference vertices' 
for our subsequent argument. 

So wc still use the trick of splitting T into a subfamily Ti and an auxiliary 
collection of 'reference indices', but this now takes typically a much more elab- 
orate form. For example, for Lemma 4.4 of [44], providing a structure theorem 
for spreadable (^^) -indexed processes, Kallenberg must embed N as a subset of 
the much more complicated total order on Q, and prove (by a straightforward 
appeal to the DanicU-Kolmogorov extension theorem) that any spreadable {^j})- 

indexed process can be correspondingly embedded in a spreadable -indexed 
process. 

The above analysis can also be pushed a little further, by introducing canon- 
ical processes (TrQ,)^^^ h -j indexed by all finite subsets of N and with values in 

some infinite palette [Kq^Ki^ . . .), and ask for the consequences of exchange- 
ability for their structure. This study, too, can be carried out for the weak- 
ened notion of spreadability, and an analogous structure theorem is still avail- 
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able: in some sense this is the end-point of the classical examination of ex- 
changeability and spreadability, appearing in Kallenberg [44]. He proves that 
there always exists an infinite auxiliary palette (Zq, Zi, . . .), probability kernels 
Pd '■ Y\ac_[d] ^-"^"^ measurable functions : Zd ^ Kd such the law of 

the canonical process (tTq)^ Fj ^ agrees with that of (^|q| (^a))^,^^ where 
i^a) iz( " \ the set- indexed process with values in the palette {Zq, Zi, . . .) 
and law built from the above kernels following the standard recipe. Aside from 
some finitary symmetry assumptions, the kernels Pd and maps kd are arbitrary, 
and this is a more-or-complete explicit structure theorem. It can be proved (es- 
sentially) by first establishing a version of the structure theorem for (possibly 
non-uniform) hypergraphs of finite rank k and then treating a general spread- 
able set-indexed process as a union of subprocesses corresponding to increasing 
k, so that the final structure emerges from an iterated enlargement of the auxil- 
iary spaces Zd to handle the sub-processes of (TrQ,)^^^ n -j indexed by larger and 

large finite subsets. 

Let us not leave spreadability without also mentioning the analysis of Fremlin 
and Talagrand [29], which appeared independently of those mentioned above. 
Their initial interest is in a class of extremal questions for quite general random 
graphs on N that were first raised by Erdos and Hajnal in [25] (and comparable 
with the questions we shall raise in Subsection 4.6 below). They first reduce to 
the case of spreadable random graphs (under the name 'deletion-invariant' ran- 
dom graphs), and then resolve them by first proving a version of the structure 
theorem and then performing a variational analysis on the ingredients that go 
into it. Their extraction of the structure theorem is rather different from that 
used, for example, by Kallenberg: having descended to a spreadable random 
graph they consider the behaviour of all its individual samples along all non- 
principal ultrafi-lters on N. These together examine the samples of the graph in 
much 'finer detail' than our analysis of conditional expectations, and circum- 
vents the need to embed N into a much larger totally ordered set to provide an 
infinite pool of auxiliary vertices. This has the advantage that the behaviour 
of the graph along any one such ultrafilter must be very simple — everything 
that can converge docs converge — but then the Cech-Stone remainder (3*N 
of N emerges to play the part of our auxiliary space Zi, equipped with some 
Radon measure vi defined from the law of the original random graph, and some 
measurable function h : /3*N x /3*N [0, 1] which (in our terminology) simply 
corresponds to P2 (•,{!})■ Their basic construction is outlined in Subsections 
2A and 2B of [29], and their version of the structure theorem in Section 5. Let 
us note that their statement of the structure theorem in terms of this auxiliary 
function h is arguably closer to the older probabilistic formalism of 'represent- 
ing' an exchangeable law, as we have discussed for Theorems 3.6, 3.14 and 3.25, 
except that Fremlin and Talagrand use (3*N in place of [0, 1], and also allow for 
the construction of different possible probability measures on (3*N. 
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4. Relations to finitary combinatorics 

We now turn to the parallels between exchangeable random colourings and 
the 'statistics' of their counterpart finitary structures, building on the general 
discussion of Subsection 2.3. In the first two subsection we compare exchangeable 
laws with other notions of 'limit object' for sequences of graphs or hypergraphs, 
and then give infinitary reformulations of various questions in property testing 
(Subsection 4.5) and extremal combinatorics (Subsection 4.6). We will finish 
by comparing the theory outlined above with related ideas in ergodic theory 
(Subsection 4.7). 

4.1. The extraction of limit objects 

There are various categories of interest to analysts and geometers in which 
asymptotic information about a sequence of objects can be parceled into a suit- 
able notion of 'limit object'. This limit object may also lie in the original cat- 
egory, or may require an enlargement through a sort of 'compactification' of 
that category. If the objects all exist within some larger ambient space with a 
given topological structure, then this ambient space may itself serve as the larger 
category; however, if no such superstructure is apparent then a more intrinsic 
definition of the limit may be necessary. Sometimes both options are available. 

Let us first discuss this program in some generality. As illustration we will ap- 
peal to two fairly classical examples: the category of metric spaces with Lipshitz 
maps and that of Banach spaces with continuous linear operators. 

Suppose we are working in one of these categories. One option may be to 
find some large ambient space into which each of a sequence of objects can be 
mapped, and then consider convergence in terms of some natural topology on 
the points of that space. Sometimes this choice of ambient space may not be 
canonical, and sometimes it may not be possible at all; however, once it has 
been made then we hope that convergence in that ambient space will be closely 
related to convergence of the parameters or other invariants of the objects that 
were of interest. Of course, we should not expect the resulting limit object in the 
ambient space to retain all of the attributes of a typical term of the sequence. It 
need only distill those properties that are relevant to the convergence, and may 
suppress the others; thus, having identified the possible limit objects, one should 
exercise caution before pursuing an overly-elaborate analogy with the original 
objects. Examples of this construction include Gromov-Hausdorff convergence 
of metric spaces (see, for example, Chapter 3 of [39]) and Banach-Mazur con- 
vergence of Banach spaces of a given dimension (see, for example. Chapter II. E 
of Wojtaszczyk [tio]). 

Alternatively we may have at our disposal a rather more forceful, universal 
construction: that of ultralimits taken along some fixed nonprincipal ultrafilter 
U. Where ultralimits can be constructed at all, they typically do not depend on 
the 'convergence' of the sequence of objects in any sense; rather, they merely 
'ignore' so much of the sequence that what they retain is forced to converge. 
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Ultralimits appear in the same categories as did our previous two examples: in 
metric spaces (Section 3.29 of [-i'*]), where they are used, for example, in the 
construction of asymptotic cones for various classes of metric space (Section 
3.29^); and ultraproducts of Banach spaces have gradually become a standard 
tool in Banach space local theory (see, for example, Appendix F of Benyamini 
and Lindcnstrauss [14]). 

Often both of the above approaches to extracting limit objects are possible. 
They differ in their merits. The first, more hands-on approach requires a careful 
check on whether the sequence of objects converges at all (unless one finds that 
there is some compactness in the ambient object to deduce the existence of 
some suitably convergent subsequence, as happens in the case of exchangeable 
random hypcrgraphs). On the other hand when ultralimits make sense at all 
their definition is typically quite easy; however, the limit objects that result may 
be quite unnatural (ultralimits of separable Banach spaces are typically hugely 
non-separable, for example), and may require further manipulation before they 
really represent the asymptotic data that was sought. Sometimes, this latter 
manipulation then leads naturally back to the same limit object as could have 
been extracted from the first approach; however, in some settings one does 
not need such precise information about the limit object, and in this case the 
ultralimit construction may often be much faster. 

The situation for dense hypergraphs and exchangeable random hypcrgraphs 
seems to be similar. As discussed previously, our construction of sampling ran- 
dom hypergraphs effectively gives an embedding of finite hypergraphs into the 
single ambient space of exchangeable probability measures on {0,1}© for some 
arbitrary countably infinite S*, and we are now able to take limits quite natu- 
rally in the vague topology of measures and identify this with convergence of 
subhypergraph densities for the original finite hypergraphs. Thus, exchangeable 
random hypergraphs can serve as limit objects in the first of our two senses 
above, and with the structure theorem and a little more work it is possible to 
describe asymptotic features of their leading-order statistics in terms of them. 

Indeed, the development of this analogy has an older precedent within com- 
binatorics: that established by the 'objective method' in the study of certain 
enumeration or optimization problems over large finite networks, surveyed, for 
example, in Aldous and Steele [■-]. The basic finitary objects here arc thought of 
as large networks but with (usually) small individual vertex degrees, and often 
endowed with a distinguished vertex called the 'root'. Often one is interested 
in the typical connectivity or other statistics of fixed-radius neighbourhoods 
around a randomly-chosen root, or of flows through the network, and in this 
case the relevant numbers can often be extracted into a suitable 'infinite random 
network' amenable to separate study. We direct the reader also to Benjamini 
and Schramm [12], Aldous and Lyons [4] and Elek [22, 23] for a view of various 
instances of this approach, including connections with the theory of graphings 
of Borel equivalence relations that would take us too far afield here. This ap- 
proach has recently also entered the arena of property testing in a suitable class 
of rooted graphs subject to a fixed maximal degree bound (we will treat prop- 
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erty testing for dense graphs and hypergraphs in Subsection 4.5 below) in works 
of Schramm [ 'i] and Bcnjamini, Shapira and Schramm [13]. 

4-2. Ultralimits and the work of Elek and Szegedy 

While the use of exchangeable laws as limit objects for finite hypergraph colour- 
ings amounts to an embedding of finite hypergraphs into a suitable compact 
metric space, then the approach via ultralimits is also possible. The (easy) ex- 
traction and (harder) structural description of these ultralimits has been carried 
out by Elek and Szegedy in [24] , where they too are able to recover certain 'lead- 
ing order' combinatorial results about finite hypergraphs from general properties 
of the limit object, including a version of the hypergraph removal lemma. 

In fact, Elek and Szegedy's use of ultralimits has a predecessor in Hoover's ap- 
proach to the basic representation theorems for exchangeable arrays of random 
variables, as outlined in [42]. However, Hoover uses the ultralimit construction to 
extract the ingredients for a given exchangeable array, rather than as a possible 
route to defining limit objects for sequences of finite graphs or hypergraphs. 

The analysis of Elek and Szegedy includes a manipulation of the (initially 
huge) non-separable limit-object to give a separable version capturing essen- 
tially the same information; however, the separable limit object that results is 
not immediately equivalent to our structural ingredients for an exchangeable 
random hypergraph. Reformulated as a structural result for exchangeable ran- 
dom hypergraphs, their theorem reads as follows. 

Theorem 4.1 (Elck-Szcgedy Theorem for exchangeable random hypergraphs). 
Suppose that fi is an exchangeable random {0, 1} - coloured k- uniform hypergraph. 
Then there is some measurable W : [0, 1]^ ~^ — s- [0, 1] such that, for any finite- 
dimensional cylinder set A in {0,1}(^) given by 



for some finite k-uniform hypergraph {V, F) with vertex set V d S, we have 



where is Lebesgue measure on [0, 1]UJ. 

(We will not consider here the extension of this result to the case of K- 
coloured hypergraphs, but it docs not seem to offer serious difficulties.) 

We will re-derive this from our structure theorem below, and refer to [24] di- 
rectly for the more combinatorial consequences of their ultralimit construction. 
Interestingly, the cubes with Lebesgue measure [0, 1]2 -1 also arise explicitly in 
Aldous' and Kallcnberg's chosen versions of the representation theorem for ex- 
changeable (and spreadable) arrays or set-indexed processes ([3, 44]), as the tar- 
get spaces of collections of independent uniform [0, l]-valued random variables. 
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As promised during our discussion of their versions in Subsection 3.7 above, 
our deduction of Theorem 4.1 from the structure theorem can be modified di- 
rectly to deduce the equivalence of Corollary 3.5 and Theorem 3.6, Theorems 2.9 
and 3.14 or Theorems 3.24 and 3.25. 

We need to remove the complicated structural ingredients {Zo,fj,o), (Zi,Pi), 
{Zk,Pk) in favour of a single measurable hmction [0,1]^ ~^ [0,1] and 
Lebesgue measure, at the expense of introducing a measurable function W as 
above. This reduction loses the description of all the intermediate systems built 
(^) 

from the spaces ^ and so, while quicker for deriving various combinatorial 
consequences, it seems to suppress some of the probabilistic features of the 
analysis. 

Our recovery of the above version of the Elek-Szegedy result will be based 
on the following standard fact of probability; see, for example. Theorem 5.10 in 
Kallenberg [4-^)]. We recall a proof for completeness. 

Lemma 4.2 (Noise outsourcing lemma). Suppose that X and Y are standard 
Borel spaces, that /i is a probability measure on X and that P : X Y is 
a probability kernel. Let /iL be Lebesgue measure on [0,1]. Then there is some 
Borel measurable map / : X x [0, 1] — s- y such that, endowing X x [0, 1] with 
the product measure /i (g) /il, the kernel P{x, ■) is a version of the conditional 
distribution of f{x, ■ ) given the first coordinate x: that is, for any A € Tix and 
B € Ey we have 

P{x,B) = ^^,®^,A^{f(x,t)eB}\x\ 
up to ^-almost everywhere equivalence. 

Remark. If we think of the kernel P{x, ■ ) as specifying a random choice of a 
point y ^ Y with law depending on a point x & X, then this lemma tells us 
that the randomness in this choice can be correctly represented by first choosing 
independently and uniformly a value for the 'noise parameter' t € [0, 1] , and then 
choosing y according to some deterministic valued map on pairs < 

Proof. Starting from a probability kernel from X to Y , we will need to build 
a deterministic function with values in Y: for this it will be crucial that Y is 
a standard Borel space, in order that we can specify individual points of it in 
some measurable way in our definition of /. Given this, we can proceed in a 
number of ways. For example, we can first identify Y with a Borel subset of 
[0, 1], and can now specify a suitable / such that for each x the map t f{x, t) 
is non-decreasing by setting 

f{x, t) := sup{(? G Q n [0, 1] : P{x, [0, q]) < t}. 

It is routine to check that this countable supremum is measurable (appealing to 
the measurability in x of the kernel P) and that it has the desired distribution 
(and so must, in particular, take values in Y almost surely). This construction 
is essentially a pointwise-in-a; implementation of the Skorokhod embedding. □ 

Given this, we can quickly reconstruct the Elek-Szegedy representation of 
Theorem 4.1: 
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Proof of Theorem ^.1. Wc will replace the spaces Zi by powers of the unit inter- 
val level-by-level from below. At stage z < fc — 1 we lose some of the information 
contained in a point of Zi, but work instead with the value of some outsourced 
noise parameter in [0, 1], as given by Lemma 4.2; this will allow us to retain 

(^) 

up to stage i the distribution /^^ on ' while replacing all of the lower-rank 
distributions with simpler outsourced noise. Thus we obtain a sequence of inter- 
mediate towers of quasifactors of ^ (after initially also identifying (Zq, /^o) with 
([0, 1], ^l)), represented by the rows in the following array: 



[0, 1] Si Zi 


Z<^2 




Z<,z -A Zz 


Z^k 


Pk 


{0,1} 


[0, ^ Zi 


Z<^2 




^<3 Zz 


Z<k 


Pk 


{0,1} 




[0, 




Z^z Zz 


Z<k 


Pk 


{0,1} 








[0,1]^+^ ... 


Z^k 


Pk 


{0,1} 



where we have written Z^i := Y[a(z[i] '^\°-V When we reach stage k we have a 
representation of yu^ = purely in terms of this noise and one last probability 
kernel; this will be represented by the function W of the Elek-Szegedy Theorem. 

□ 

4.5. Measures on spaces of isomorphism classes and the work of 
Razborov 

Much of the work of this paper has been to describe those probability mea- 
sures on a space if^"^^ of colourings by a /c-palette K of either the subsets 
or the directed hyperedges of a vertex set S that are invariant under finitely- 
supported vertex permutations. Any two points of this space lie in the same 
orbit of this SymQ(S')-action if and only if they are isomorphic as (directed) 
coloured hypergraphs by a finitely-supported vertex permutation, and so we 
may regard the study of such as a natural alternative to working directly with 
probability measures on the space of hypergraph colouring isomorphism classes, 
X(^)/Symo(5). 

Indeed, the quotient by SymQ(S') destroys all of the nice topological and 
Borel space structure of the space K^^\ and so working directly on this space of 
isomorphism classes is more difficult. This topological difficulty seems to appear 
in some guise or other in any attempt to study a limit object for hypergraph 
colourings, even though it might seem more natural a priori to consider these 
only up to isomorphism. 

Exchangeable random colourings of (^j.) or ljj<j, Inj([i], S) are a convenient 
alternative requiring only very classical probabilistic ideas. However, there are 
other ways to circumvent this difficulty. Here we will give an overview of one 
such, and compare it with the study of exchangeability. This is the approach 
the underlies recent work of Razborov [-54, -5.5], and rests on a conversion of the 
necessary data into certain very specialized abstract commutative algebras ('flag 



T. Austin/On exchangeable random variables 



129 



algebras') and M-valucd homoniorphisms on them. In fact, Razborov details his 
construction in the slightly different lexicon of model theory (much as outlined 
briefly in Subsection 3.8 above), and we have made a partial translation into that 
of hypergraph colourings in order to make the comparison with the results of the 
present note simpler. In addition, Razborov is by no means the first to consider 
a notion of exchangeable probability measure on a space of models of a simple 
theory; for example, these appear already in the studies by Gaifman [34] and 
Krauss [51], and their work is then offered as motivation in Hoover's article [42]. 
Rather, the novelty of Razborov's approach is his use to which he puts these 
ideas in the pursuit of certain purely finitary combinatorial results; given the 
relevance of this to the aims of the present survey, we shall concentrate on 
Razborov's chosen route through the theory below. 

Just as for many of the other constructions described in this survey, the study 
of these is motivated by their ability to serve as proxies for the leading-order 
statistics of large graphs or hypergraphs. Razborov's approach emphasizes first 
an abstractly-defined commutative algebra over R defined in terms of isomor- 
phism classes of finite hypergraphs, and then those of its linear homomorphisms 
to K that enjoy a certain positivity property. It turns out that these algebras 
correspond in our picture precisely to certain quotients of the algebra of con- 
tinuous functions on K^^\ and that routine soft functional analytic arguments 
now identify the positive homomorphisms of this algebra with the exchange- 
able probability measures on this space that are 'effectively concentrated on a 
single equivalence class' (that is, are ergodic). This is how Razborov's algebras 
and their homomorphisms give an alternative means for handling the difficulty 
of defining a probability measure directly onto the space of these equivalence 
classes. 

Here we will discuss the relation between Razborov's formalism (set up in 
the early sections of [54]) and that of exchangeable random hypergraph colour- 
ings. For simplicity we will specialize to the case of undirected ii'-coloured hy- 
pergraphs for some fc-palette K, so that the notation If '"^^ will be used as in 
Subsection 3.3. The new notation we shall introduce to describe flag algebras 
follows [54]. Razborov's work actually applies in the quite general setting of 
models of theories outlined in Subsection 3.8; a more detailed discussion in this 
greater generality can be found in [9] . 

Razborov goes on to use his machinery to conduct a variational analysis of 
a problem in graph theory, an instance of the 'graph copy problem' relating 
to edge-versus-triangle densities in large graphs (see also Subsection 4.6 below). 
This leads to a quite specific optimization problem (in some ways similar to that 
appearing in Fremlin and Talagrand [29]) studied in Sections 4 and 5 of [54], and 
then analyzed completely in [55]. While they too can presumably be translated 
into the measure theoretic picture, this actually seems to change the arguments 
involved very little. 

Let us first consider isomorphism classes of those finite if-colourings that 
contain a distinguished copy of some fixed finite X-colouring a; that is, pairs 
F = (M, 9) with Af a finite iC-colouring and 9 an embedding a ^ M. These 
isomorphism classes of extensions are examples of the 'a-flags' of [54], and T'^ 
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is written for the collection of all them, for the subcoUcction of those with 
vertex set of size i (so Tf = unless £ > k), and we can now specify the obvious 
notions of embedding and isomorphism for a-flags. 

Let us restrict our attention to cr-flags F = (M, a) with AI £ K^^^^^^ for some 
i > 1 and a itself serving as its distinguished copy in M (so we implicit order 
the vertices of M so that a = Af\[r] for some r). Let := K^^\ let ficr be the 
subset of those oj G such that uj\[r] = u, and now associate to each F e JF'^ 
the finite-dimensional cylinder set 

Af ■■= {uj e fl : = M} C $1^ 

(so ficr = ^(<T cr) ; the difference in notation reflects the different roles of cr and 
F). 

Let us write C(Ocr) and A4{ilcr) for the usual Banach spaces of real- valued 
continuous functions and signed Radon measures on fl^ respectively; the Riesz- 
Kakutani representation identifies A4{na) isometrically with C{^la)*- Moreover, 
we write M'^ for the subspace of measures supported on fl^ and invariant under 
finitely-supported permutations of the vertices in N \ [k] ; we will call these <j- 
exchangeable, and denote the group of these permutations by Sym^^ (so this is 
just the stabilizer of 1, 2, . . . , fc in Symg(N)). We write {Ai'^)-^ for the annihilator 
of this space of continuous linear functionals in C(ri(j), 

{Mn^ = {/ e C{n„) : (/, fi)=OVfie M'^Y 

as usual, the dual-of-the-quotient Banach space (C(fio-)/(A^'^)^)* can be iso- 
metrically identified with M'^ . Let q^, be the quotient map C(i7(j) — > 
C(f2<,)/(7W-)^. 

Given any / g 0(^1,,), we must have f — f o g G (jM°')^ for all g £ Sym^.. 
Write Tcr for the tail cr-subalgebra nm>fc+i ^{'^[k]u{m,m+i, ...}), and (with a slight 
abuse of notation) L°°{T„) for the bounded T^-measurable functions under the 
equivalence relation of "equality fi-a.e. for every /z S A^'^". Clearly these are 
invariant under the action of Sym^.. By (for example) the pointwise ergodic 
theorem for the amenable group Sym^., the ergodic averages of the compositions 
fog over g converge to a To.-measurable function / on fJo- which is defined /z- 
almost everywhere for every fi £ M'^ and is invariant under Sym^., and hence 
actually specifies a member of L°°{T^). Observe that / = /i for /, ft. G C{fla) 
if and only ii f - h G (A^'')-'-, and so our map f ^ J embeds C{p.cr) / [M'")-^ 
as a subspace of L°°(T^); general nonsense now shows also that this is an 
isometric embedding, so is closed. 

Furthermore, V is actually a snhalgebra of L°°(To-). To see that it is closed 
under multiplication, suppose f,h £ C(f2o-), and now consider the products 
f ■ {h o g) for any sequence of permutations g that pushes h 'further and fur- 
ther out', in the following sense: for any m > 1, there are finite A, B C N \ [r] 
such that / and h are uniformly (1/to)-c1osc to functions depending only on 
the colours of edges above vertices in A and B (respectively), and now we insist 
that g move B into N \ (A U [r]). Letting m ^ oo this gives a sequence gm for 
which, in terms of their dependence on coordinates, / and ho gm are closer and 
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closer to independent. Now an elementary argument using approximation by 
step functions shows that the quotients q^^f - hog) converge in C{VL„) / [M'^)^ 
to a member that depends only on Qaif) and qa{h); and it is a routine exer- 
cise to check that this actually defines a C*-algebra product on C{fla)/ {M°')'^ 
corresponding exactly to the usual product of functions in . 

Thus we have identified C{n^)/{M'')^ with a closed subalgebra V of L°°(T^) 
(with a newly-defined product). Let us call functions in V"' simple if they are 
the images of simple (equivalently, finite-dimensional) functions in C{ila)- We 
can describe the simple functions naturally as follows: to any fixed nonempty 
cylinder set A C depending on coordinates in J C N \ [fc] corresponds a 
collection of finite models of the theory T on the vertex set J (with some mul- 
tiplicities), and now the averaged- function 1^4 (w) for lj G fio- is just the sum 
of the densities with which each of those finite models appears isomorpliically 
as a submodel of oj (now summing over the multiplicities). Referring to such a 
function 1^ for A corresponding to a single model on J (so that our general A 
is a disjoint union of such) as a statistics function, the simple fimctions in 
are now just linear combinations of statistics functions. We write Vq for the 
dense subspace of these. 

We now begin our account of flag algebras themselves. Let KJT'^ denote the 
free M-vectorspace on T'^. We modify the correspondence F Ap intro- 
duced above to associate to a member J2j ^kFj S M.F'^ a corresponding linear 
combination of the indicator functions l^i^ of these Ap, with some suitable 
renormalizing constraints (which depend on from Razborov's choice of formal- 
ism): 



The step functions that appear above are continuous on fl^ since each Ap is 
clopen, and so this deflnes a linear operator <I> : M.T"' (^(fJ^) with image some 
peculiar subspace contained within the space of simple functions in C(ricr)- 

The arbitrariness in our choice of the subsets Ap corresponding to F is re- 
flected in a similar arbitrariness in the linear map $ into C'(fla)', however, this 
disappears at the next step, when we define a flag algebra as a quotient of 
RjF'^. Let be the subspace of M.J^'^ generated by the linear combinations 
F — ^p^j7^ p{F,F)F for different £ > \V{F)\, and for certain real numbers 

p{F,F) (given in Definition 1 of [54]). It turns out that, given our chosen nor- 
malization in the definition of <!> above, the values p{F, F) are such that $(/C'^) 
is precisely the set of those a € for which <I>(a) G C{ila-) is annihilated 

by every exchangeable probability measure on flcr , and thus by all finite signed 
measures obeying the vertex-permutation symmetry (this is discussed in a little 
more detail in [9]). 

We now consider the quotient space 21*^ := MJF'^//C'^, this is Razborov's flag 
algebra. By the above, qa°^ factors through this quotient to give an injective 
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map : W ~> . Moreover, since any finite dimensional cylinder set contained 
in can be identified with some Ap upon a suitable permutation of coordinates 
in N \ [k] , the image of ^' is actually the subspace Vq of all simple functions in 
V^; as such, it is dense. 

Under the above identification the image of Razborov's definition of the prod- 
uct of a,6 G M.!F°' /K."' is now the product of ^'(a) and ^{b) as L°°-functions on 
fl^l the proof in [o-l] that this product is well-defined translates into a proof in 
the exchangeability picture that this product remains in the image of \1/ (that 
is, in Vq). This completes our identification of the fiag algebra St"' with the 
dense subalgebra Vq of , which is itself a norm-closed Banach subalgebra of 

The overall approach during the early stages of [54] is to define fiag alge- 
bras first (Section 2), and then to specify a collection of 'limit objects' for the 
statistics of large models of a theory as certain homomorphisms of these flag 
algebras (Section 3). Thus, the next step is to consider multiplicative functions 
: St"' ^ IR that are non- negative on the image of any single fiag F G 
the set of these is written Hom^(2t"', M). Now, having identified St" with the 
dense subalgebra Vq C F" so that the images of single flags correspond to 
the single statistics functions, we can easily check that non-negativity on these 
implies non-negativity on any member of Vq which is itself non-negative as a 
real-valued function. Therefore, given the non-negativity of such a (j), it follows 
that it must be bounded as a linear functional on Vq , and so can be extended 
to a multiplicative linear functional on V"' . 

We could now, if we wished, apply certain standard representation theorems 
to this space (perhaps most directly the results of Yosida and Kakutani for M- 
spaces, as presented in Section XII. 5 of [66]). This would identify cf> with a point 
of the spectrum of V"' (for one or other interpretation of 'spectrum'). In fact, a 
similar observation is more-or-less implicit in Remark 4 of Subsection 3.2 of [54], 
although there we still require some of the basic structure of Hom^ (21" , R) to 
have been identified. 

However, given our identification of 2t" with Vq , we have an alternative to 
the above. Since = C{ilcr)/{M°')-^ , as a linear functional on we can 
identify cj) with a member of A^" (uniquely, since {C{^la)/ {M'^)'^)* = M"^): a 
Sym^-invariant measure on the space f^o- that we started with, rather than a 
point of some abstractly-produced new space Spec V". It is now easy to check 
that those measures in A^" that are multiplicative on V" are precisely the er- 
godic Sym^ -invariant probability measures on fl„ (since any member of is 
/x-a.s. constant if € Al" is ergodic). Now the order defined on 2t" in Definition 
5 of Section 3 of [54] is precisely the usual pointwise order on Vq as a set of real- 
valued functions; in the setting of abstract fiag algebras, where we are unable 
to define anything 'pointwise', the functionals of Hom^ (St" , M) are needed as a 
replacement to formulate this definition. This constitutes the promised corre- 
spondence between exchangeable random iC-coloured hypergraphs (on N, say) 
and the homomorphisms of Razborov's theory. 

Let us finish this subsection by noting that while Razborov's work seems 
to come closest to developing a theory of 'leading-order statistics' in terms of 
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probability measures or other structures defined directly on a space of isomor- 
phism classes, there are precedents for such an approach among the study of 
certain other combinatorial structures: again, these can be found in the 'ob- 
jective method' mentioned in passing in Subsection 4.1 above and recounted 
in Aldous and Steele [5]. Perhaps the simplest instance of the extraction of a 
limit object as a measure defined directly on a space of isomorphism classes of 
infinite models occurs in the work on limit objects for various classes of sparse 
rooted graph (see, in particular, the papers of Aldous and Steele [5], Aldous and 
Lyons [4], Benjamini and Schramm [12], Benjamini, Schramm and Shapira [13] 
and Elek [-■>]). For example (as in [12]), in the setting of all rooted graphs (G, o) 
satisfying some fixed maximum-degree bound d>l, one can associate to such a 
graph G a probability measure /ic directly on the set of all isomorphism classes 
of such rooted graphs of maximum degree at most d, by defining the probability 
of the class of a given such graph {H, v) to be the proportion of possible roots 
o G V{G) for which (G, o) = {H, v). The set of all such isomorphism classes can 
be endowed with a natural compact metrizable topology by defining a neigh- 
bourhood base as those subsets of equivalence classes for which a ball of a given 
finite radius around the root lies within a given isomorphism class, and we may 
now take vague limits of these measures fie for suitable sequences of G. This 
ability to study a natural — albeit, perhaps, not easily visualized — compact 
metrizable topology on the space of isomorphism classes distinguishes this set- 
ting from that of dense graphs. On the other hand, in the latter setting the 
possibility of working in the much larger space of all fc-uniform hypergraphs on 
N {not quotiented by the isomorphism relation) is available, and makes contact 
with the classical theory of exchangeability; it is not clear that the setting of 
bounded-degree graphs admits a similarly nice class of 'labeled' models on which 
we could study those probability measures invariant under automorphisms. 

4-4- Comparison with finitary regularity lemmas 

Since Szemeredi's introduction of his original graph regularity lemma on the 
way to his proof of Szemerdi's Theorem ([til]), it has become perhaps the single 
most powerful tool in extremal graph theory (see, for example, Sections IV. 5 and 
IV. 6 of BoUobas [16]). More recently, various regularity lemmas have emerged 
for hypergraphs with enough strength to reproduce many of the results from 
the graph case: in particular, those of Nagle, Rodl and Schacht [53] and of 
Gowers [30]. These results can be described loosely as guaranteeing that to an 
arbitrary very large fc-uniform hypergraph there must correspond some smaller 
non-uniform hypergraph, of order bounded only in terms of an a priori fixed 
error tolerance, and some [0, l]-weights on its hyperedgcs, such that a certain 
normalized weighted count of embedded copies of a given small hypergraph in 
the latter is a good approximation to their normalized count (the 'density') in 
the former (we shall not make this more precise here). They all rest on one or 
another notion of 'quasirandomness' for hypergraphs. 

It emerges naturally in all the higher-rank regularity lemmas that in order 
to obtain a good approximation to the densities of small embedded subgraphs 
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in a very large fixed fc- uniform hypergraph (V,iJ), one must first introduce a 
partition of the vertex set that controls only certain statistics pertaining to 
the (fc — l)-liyperedges over V ^ and then given this information obtain a much 
finer partition that controls statistics pertaining to H and to this first partition 
through the (fc — 2)-hyperedges over V , and so on; the final partition of V is 
obtained after k such steps. This feature is present in different ways in the 
Naglc-Rodl-Schacht and Gowers regularity lemmas. Gowers' approach, which is 
tailored very precisely to the needs of proving hypergraph removal lemmas (and 
thus also Szemeredi's Theorem), rests on the use of 'Gowers norms' to control the 
count of 'octahcdra' in a hypergraph (for which wc introduced infinitary analogs 
in Subsection 3.7 above). On the other hand, Nadle, Rodl and Schacht obtain an 
iterative control of the correlations of the original fc-uniform hypergraph with 
^-uniform hypergraphs for i < k, starting at i = k and working downwards. 

A clear discussion of the similarities and differences in these two finitary 
hypergraph regularity lemmas can be found around Definitions 2.5 and 2.6 and 
in Section 10 of Gowers [.SS]. 

We recall these observations here because this picture of exerting control from 
the top rank downwards — particulary in the form it takes in Nagle, Rodl and 
Schacht 's work — is reminiscent of the downwards- inductive construction of our 
ingredients, in which each of the resulting probability kernels must be allowed 
to depend on type spaces of all ranks beneath it. 

4-5. Property testing, repairability and joinings 

A more specific class of questions that can be set up roughly in parallel for 
finite (directed or undirected) hypergraph colourings and for their exchangeable 
random counterparts is that of testing and local repair of hereditary hypergraph 
properties. The basic questions of property testing have attracted considerable 
attention during the last ten years, and we shall discuss them only very incom- 
pletely here; sec, for example, the papers of Rubinfeld and Sudan [")7], Alon 
and Shapira [8] and Alon. Fischer, Krivclcvich and Szcgedy [■ ;], and the further 
references given there. 

In this subsection we shall work with coloured directed hypergraphs, as we 
will in [fO]. Suppose that if is a finite fc-palette and that 7^ is a property of 
if -coloured directed hypergraphs: that is, it is a subcoUection of the collection 
of all isomorphism classes of finite if-coloured directed hypergraphs. In this 
case, given a vertex set V we shall write p(^) for the set of all if-coloured 
hypergraphs on V satisfying V (that is, members of if whose isomorphism 
class is in V), and shall write that the members of p(^) obey V. 

Definition 4.3 (Hereditary property). A property V as above is called hered- 
itary if whenever G € p^^") and W V then the induced if -coloured hyper- 
graph G\w on W is in T'^^). 

In case V is hereditary, wc may naturally extend its definition to coloured 
directed hypergraphs G on infinite vertex sets V by specifying that G obeys V 
if and only if all of its finite induced subgraphs do. 
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Definition 4.4 (Testability). [57] Let if be a finite fc-palette and let 7^ be a 
hereditary if-coloured directed hypergraph property. We say that V is testable 
with one-sided error if for every e > there are an integer A'' > 1 and a real 
number 5 > Q for which the following holds: if G G K'^^'^ is a if-coloured 
hypergraph on a vertex set V with with < < oo, and G 'locally almost 
obeys in the sense that 



In the case of undirected {0, l}-colourcd graphs or hypergraphs, it has re- 
cently been shown in work of Alon and Shapira [7] and Rodl and Schacht [56] 
that every hereditary property is testable. Their arguments are purely finitary, 
each resting on several applications of a graph or hypergraph regularity lemma 
to 'process' a given hypergraph (although the details of their approaches are very 
different). However, reconsidering these results in the setting of infinitary ex- 
changeable random hypergraph colourings shows some interesting features, and 
also proves convenient for several further extensions of the combinatorial results 
to directed, non-uniform hypergraph colourings with larger finite palettes. This 
approach will be elaborated in detail in [10], where the main positive result is 
the assertion that all hereditary properties in the full generality of Definition 4.4 
are testable. 

In [10] we will also introduce certain related notions of 'local repairability', 
which amount to additional conditions on the manner in which a suitable mod- 
ification G' is to be foimd in Definition 4.4. In particular, they demand that 
there be some randomized procedure for modifying G to G' that modifies the 
colour of a given edge in G according to the behaviour of G across only that 
edge and some bounded number of other vertices, these latter chosen at random, 
and that yields a suitable modification with high probability. These notions of 
'local repairability' take various forms, both in terms of finite coloured directed 
graphs or hypergraphs and with related versions for their exchangeable random 
cousins. We will not elaborate on the different forms of 'local repairability' here, 
referring the reader to [10] instead, but will restrict ourselves to an informal 
statement of the infinitary counterpart of simple testability as illustration of 
these relationships. 

Thus, let if be a finite fc-palette, let 7^ be a hereditary property of if-coloured 
directed hypergraphs. We will write that V is infinitarily testable with one- 
sided error if whenever /i is an exchangeable random if -coloured directed 
hypergraph on a countably infinite vertex set S whose sample points obey V 
almost surely and £ > is an error tolerance then the following holds. If we 
extract the structural ingredients guaranteed by Theorem 3.23: 




then there exists G' e T'^^) which is close to G in the sense that 
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• a fc-palcttc Z ~ {Zi)^^Q consisting of totally disconnected compact metric 
spaces, 

• continuous maps Ki : Zi ^ Ki^ 

• and an exchangeable random Z-coloured hypergrapli under which the 
clusters of random variables (7r^)</)einj([i].M) for distinct u G ('^) are rela- 
tively independent when conditioned on all the random variables tt^ with 
ifj € Inj(< i, S), and ^' satisfies (kq, ki, ■ ■ ■ , Kfe)^ /i' — Mi 

then there exists a Symg-covariant probability kernel T : Z^^^ A'''^' such that 

• the map (z0, (zi)ige, . . .) T{z[d, {zi)i^e, ■ • ■ , • ) is vaguely continuous from 

into Pr 

• for each W C S, the projected measure (tt/^, )^(r(z0, (zi)ig5, 

. . . , • )) depends only on (zg, ■ • ■), 

• the probability measure T(z0, (2^)^55, ...,•) is concentrated on V'--^^ for 
every (z©, (zi),gs, . . .) G 

• and T is close to k under /i' in the sense that 

J^^^t[z, [n(^^^^ j^.(j^j^^j) "'(/V \ {{ko{z,), («i(z.)).ee, ■ ■ •)})) 

for any e € (^) (the choice being irrelevant by exchangeability). 

The proof of the positive results of [10] will rely on a 'correspondence princi- 
ple', which asserts that a hereditary property V is testable with one-sided error 
if it is infinitarily testable with one-sided error, and similarly for the stronger 
finitary and infinitary notions of 'strong local repairability'. Having established 
this principle, the bulk of the work of [ lO] will go into proving infinitary testa- 
bility, using the formalism of exchangeable random colourings and the structure 
theorem 3.23. Interestingly, we find that both finitary and infinitary strong local 
repairability can fail for some hereditary properties, but they do always hold in 
the presence of various additional assumptions (for example, in rank 2 or if the 
property is actually monotone). 

This approach to proving testability results has already appeared in the spe- 
cial case of hypergrapli removal lemmas in Tao [(i2], making only partial use of 
the full structure theorem for exchangeable random hypergraphs. 

4-6. Extremal problems 

Szcmercdi's Theorem asserts that any positive-density subset of a sufficiently 
long discrete interval contains long arithmetic progressions. By contrast, the 
analogous search for copies of a given small hypergraph in a large dense hy- 
pergraph is typically not a foregone conclusion, and the nature of our question 
changes: rather than ask whether every positive-density hypergraph must con- 
tain embedded or induced copies of a fixed small hypergraph F, we instead turn 
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to the classical Turan question of how dense our large hypcrgraph must be in 
order to guarantee the presence of such subhypergraphs. 

However, structurally this adds a new kind of complexity; for in the typical 
case in which the Turan density lies in (0,1), identifying it precisely usually 
requires also a precise description of the density-extremizers among those hy- 
pergraphs that do not contain any embedded copies of the small hypergraph. 
Thus we must delve into the structure of individual hypergraphs in order to find 
these extremizers (at least up to their leading-order statistics), and so establish 
the value of the critical density. To date, this program has been successfully 
carried out for graphs, but only a handful of results are known for higher-rank 
hypergraphs; see, for example, the survey by Sidorenko [60] and the construction 
of conjectured edge-density extremizers among 3-uniform hypergraphs contain- 
ing no tetrahedron in Kostochka [50] and (for a nice systematization of the 
Kostochka examples) Fon-Der-Flaass [2G]. 

Such extremal questions also have natural formulations for exchangeable 
random hypergraphs. Let H he a, fixed finite /c-uniform hypergraph on some 
finite vertex set / C N; identify H with some i] G {0, We wish to 
know what is the supremal (5 > (the Turan density of H) such that 
there exists an exchangeable random fc-uniform hypergraph ^ on {0, ll^*:) with 
/^('^a2,...,fc}{l}) = ^ and 

li{uj G X : t^l^^) > = 0. 

Note that it is clear by compactness and continuity that some fi attains this 
supremum. We refer to ii{'k^^^ fc}{l}) ^^'^ edge density of fi; and will 
say that is H-iree in case the second ^-probability above is zero. Simply by 
approximating such random hypergraphs by finite hypergraphs one sees that 
the resulting Turan density of H is the same in either setting. 

One can also ask the induced variant of this question, for which the condition 
'j^l(^) > J? is replaced by = V- 

Furstenberg's ergodic-theoretic approach to Szemeredi's Theorem can help 
to find a proof because we can reduce that theorem to a result about all non- 
negligible subsets of a probability-preserving system. On the other hand, the 
problem of understanding the structure of possible extremizers in the hyper- 
graph setting, although easily formulable in both finitary and infinitary settings, 
is not clearly more vulnerable to attack in the latter than in the former. The 
structural ingredients of an exchangeable random hypergraph do offer a possi- 
bly richer collection of data over which to optimize when seeking to isolate the 
Turan extremizers, and, moreover, some of which can be varied continuously. 
However, it is not clear when this additional manoeuvrability might actually 
make the search any easier. One important instance of a related question in 
which such an infinitary formalism does seem to be more manageable is that 
studied by Razborov in [-54, -55], where the interest is in what possible pairs of 
values {6i, 62) € [0, 1]^ can appear (up to OAr(l) corrections, for graphs with very 
large numbers of vertices N) simultaneously as the edge- and triangle-densities 
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of a very large finite graph; Razborov uses a variational analysis on the objects 
of a different itary formalism (introduced in Subsection 4.3 above) to answer 
this question completely. 

Finally, we note that such a variational approach to the structure of ex- 
tremizing measures appears already in Section 4 of Fremlin and Talagrand [29] . 
Indeed, the motivation for their work is the analogous extremal questions for 
spreadable random graphs (see Subsection 3.9 above) for which the slightly 
weaker symmetry results in slightly different critical densities. The hypothesis 
of spreadability does not seem to relate to any simple finitary situation through 
a correspondence principle as in Subsection 2.3, and this may be partly why the 
Fremlin- Talagrand analysis seems to be largely unknown in the combinatorial 
community. 

4-7. Broader context in ergodic theory 

An interesting viewpoint of the correspondence between statistics of large dense 
hypergraphs and exchangeable random hypergraph colourings, which we have 
not explained previously, is that it fills a suggestive lacuna among some of the 
known approaches to Szemeredi's Theorem. 

On the one hand, two separate 'purely combinatorial' proofs of Szemeredi's 
Theorem are now known, both relying on strong regularity lemmas: Szemeredi's 
original proof ([Gl]) led to his introduction of the regularity lemma for graphs, 
and more recently hypergraph extensions of the regularity lemma have been 
introduced by Naglc, Rodl and Schacht [53] and by Gowcrs [.-]{>] and used to 
give a different combinatorial argument. 

On the other hand, in 1977 Furstenberg [30] gave another proof, superfi- 
cially very different, relying on a translation of the problem into the highly 
infinitary language of ergodic theory via a another correspondence principle, 
followed by an analysis of the relevant types of behaviour that an arbitrary 
measure-preserving system can display. This latter analysis relies on a 'structure 
theory' for probability preserving Z-systems, developed by Furstenberg in [30] 
and by Zimmer in [69, 68]. An accessible introduction to the extent of this the- 
ory needed for a proof of Szemeredi's Theorem can be found in Furstenberg's 
book [31]. 

In essence, Furstenberg and Zimmer show how to extract from a probability- 
preserving Z-system (X, E, /i, r) a (possibly transfinite) tower of factors 

X ^ ■ ■ ■ ^ Y,i ^ ■ ■ ■ ^ Y2 ^ Yi ^ Yq 

such that Yo is the invariant factor, for any limit ordinal ?/ the factor Yjj is the 
inverse limit of its predecessors, and for each 77 the extension ^,,+1 — > takes 
a certain 'primitive' form: it is either 'relatively weakly mixing' or 'relatively 
compact'. Relative weak mixing tells us that given a pair of functions /, 5 G 
L°°(y^+i), / and 5 o r" are asymptotically relatively independent over Y,j (in a 
certain weak sense) as n — *■ cx3. Relative compactness is a counterpoint to this, 
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according to which the shifts gor" of the function g e arc constrained 

in the forms they can take relative to the subfactor as n — ^ oo. 

During the long period of research since these early papers it has become 
clear that many of the ideas underlying these two approaches to Szemeredi's 
Theorem are analogous. It turns out that the exchangeability theory can be 
seen as making this analogy a little more concrete, by offering an infinitary 
picture that relates to hypcrgraphs (via the correspondence principle of Sub- 
section 2.3) as do measure-preserving Z-systems to the additive combinatorics 
of Szemeredi's Theorem, and in which the structure theorems for exchangeable 
random hypcrgraph colourings then take the place of the Furstcnberg-Zimmcr 
tower. 

However, notwithstanding the analogy outlined above, our structural analysis 
differs from that for Z-actions in some instructive ways. 

Firstly, while the work of Furstenberg and Zimmer applies to an arbitrary 
probability-preserving group action (although some further refinements are 
needed to extract the multidimensional Szcmcredi Theorem; see [32]), in the 
setting of exchangeable random hypergraph colourings or set-indexed systems 
we are considering only a very special restricted class of probability-preserving 
Symg(5')-actions, precisely because we insist that our measures live on the prod- 
uct space K"^ and be invariant under coordinate permutations. Such special 
cases are very far from representing a generic SymQ(S')-systcm. 

Secondly, and perhaps more curiously, our structural description necessitates 
a slight extension of the notion of factor that appears in the Furstenberg-Zimmer 
theory: we work instead with a tower of 'quasifactors' (for which the relevant 
probability kernel is not directly recoverable from the original system, but only 
after our trick of introducing additional 'auxiliary vertices'), and the second 
example of Subsection 3.6 shows that this is really necessary. Each quasifactor 
in our tower has a simple description in terms of its predecessors, but in general 
those predecessors may not be recoverable as factors of the original system. 
While quasifactors certainly do play a role in general ergodic theory also (see, 
for example. Chapter 8 of Glasner [35]), it seems that to date the basic ergodic 
theoretic analyses of questions of multiple recurrence (or, relatedly, convergence 
of nonconventional ergodic averages, as in [43, G7]) has always proved reducible 
to the study of true factors of the original system. 

However, perhaps the most instructive difference is in the basic constructions 
that under ly the proofs. Both the exchangeable law structural results and the 
Furstenberg-Zimmer theory rest on our ability to extract, given a suitable ex- 
changeable or stationary law on the product space K"^ (with T = Z in classical 
ergodic theory), a F-invariant cr-subalgebra T of such that, firstly, T enjoys 
some strengthened symmetry or other additional structure under the law (up 
to negligible sets), and secondly the coordinate projections ttj enjoy some addi- 
tional regularity (such as relative independence or some weaker notion related 
to it, such as relative weak mixing) when conditioned on T under /i. Typically a 
'composition series' comprising several nested such cr-subalgcbras is called for. 

Of course, the details of these additional demands made on the cr-subalgebra 
vary among different notions of exchangeability, and are different again in the 
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ergodic-theoretic setting — indeed, much of the ingenuity behind each theory 
hes in the selection of the right demands to impose. However, in some sense 
the most basic difference between the methods of exchangeabihty theory and of 
Furstenbcrg-Zimmer theory hes in how these a-subalgebras are specified. 

In the case of exchangeable processes, the decisive observation for solving our 
problem is that we can embed a copy of the index set T as an infinite-coinfinite 
subset Ti of itself, so that the subprocess {TTt)t£Ti is effectively just a copy of the 
original process and so that the remainder process (7rt)tgT\Ti ^^^"^ ^^'^^^ enjoys 
enough symmetry that we can construct a suitable auxiliary process (tt^ 
simply by grouping together suitable further subprocesscs of this remainder 
process. The clusters of random variables obtained from this grouping then 
specify directly the factor needed to describe the subprocess {■Kt)t£Ti, and so 
by our earlier identification they also tell us the law of the original process 
{■Kt)t^T- Thus, with the right sleight of hand, we can in this setting specify our 
auxiliary process directly by grouping into subprocesses random variables from 
the original process. 

In practice, this approach clearly relies on the ability to move the coordinate 
projections tt^ about by the action T r\ T very flexibly; in particular, in the 
proofs of the relative-independence assertions of Propositions 3.4 and 3.12 we 
made crucial use of our ability to move some vertices around almost arbitrarily 
while leaving certain other vertices fixed. 

In ergodic-theoretic problems, however, we must usually do without this 
'overwhelming strength' of the action of F on T. In this setting, is is there- 
fore more common to obtain the a-subalgebra T (or, equivalently, the random 
variables 7rf or the factor they specify) as suitable limits of functions of several 
of the original projections TTj, either by averaging over the action of F (which is 
usually amenable) or sometimes by taking some more exotic kind of limit, such 
as the 'IP-limits' that under Furstenberg and Katznelson's analysis of general 
'IP systems' in [33]. This approach generally requires more hard analysis, firstly 
to show that our averages or limits defining the tt^ exist at all, and then that 
they specify an auxiliary process that still enjoys the desired properties (since 
this may not now be so easy to read off from the corresponding properties of 

(7rt)tGT). 

Often — particularly in ergodic theory — there is more than one way to 
extract these auxiliary processes, and our knowledge of structural results has 
improved over time owing to refinements to the techniques that are available. 
In particular, the original Furstenbcrg-Zimmer structural approach — while 
still the basic foundation of the analysis of multiple recurrence phenomena for 
Z-actions — has gradually been complemented by a much more precise de- 
scription of those 'factors' of a probability-preserving Z-system that govern the 
convergence behaviour of various 'nonconventional ergodic averages' (which are 
somewhat analogous to individual products of observables in our setting). We 
direct the reader, in particular, to the two slightly different treatments of these 
'characteristic factors' by Host and Kra [43] and by Zicgler [G7]. 

As a result of this difference in extraction-method, the Furstenberg-Zimmer 
structure theory (and its later relatives) uses factors built from the 'bottom up': 
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starting with the raw Z-action, a tower of factors is built recursively upwards, 
each obtained by an extension of its predecessor (and with inverse limits taken 
up to limit ordinals, if desired). The resulting description of the overall system 
is much less precise than in the exchangeability theory: the Furstenberg-Zimmer 
structure theorem yields a large tower of 'relatively compact' extensions (a very 
special kind), followed by one last 'relatively weakly mixing' extension (about 
which we know very little, except that it is irrelevant for the estimation of 
various nonconventional averages). 

In contrast, our structure theorem for an exchangeable random hypergraph 
colouring is proved from the top-down: starting from an invariant random hy- 
pergraph colouring /i of rank k, we pass in one step down to a quasifactor that 
is itself a rank-(fc— 1) random hypergraph colouring, and show that the original 
/X enjoys the desired relative independence of the random variables tTe over this 
quasifactor (a much more precise condition than 'relative weak mixing'). After 
iterating this argument k times, we have obtained the whole structure of the 
original /i, leaving only very completely-described degrees of freedom at each 
step. 

Naturally the above distinction is not precise, nor does it completely account 
for all manners of specifying cr-subalgebras that appear in this field. In partic- 
ular, many of the constructions on the ergodic theory side can be recovered by 
considering instead a suitable invariant cr-algebra for some measure-preserving 
action, either within the original system or within some extension of it (the 
work [43] of Host and Kra providing a particularly striking example of such an 
approach). However, such a a-subalgebra is still precisely that which is gener- 
ated, up to //-negligible sets, by the abovementioned ergodic averages; and such 
averages inevitably then surface elsewhere in the proof that these cr-algebras 
have the desired properties, so this difference seems to be rather cosmetic. 
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